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THE CHARACTER AND DEGREE OF PSYCHOPATHOLOGY IN 
RELATION TO SUSTAINED INTELLECTUAL PERFORMANCE 


ROBERT ROESSLER AND LEIGH ROBERTS 
University of Wisconsin Medical School 


PROBLEM 

Psychiatrists are repeatedly confronted with patient complaints of impaired 
intellectual functioning. This 1s particularly true in a college setting, where student 
patients almost universally complain of inability to concentrate, lowered academic 
performance and anxiety over examinations. It is probably safe to predict that most 
psychiatrists, asked to formulate hypotheses concerning the effect of psychopath- 
ology on intellectual performance, would predict impaired performance proportion- 
ate to the character and degree of pathology. The present study is designed to test 
these subjective impressions on a college population. 

A review of the literature reveals two groups of pertinent studies. One group, 
dealing with the effect of variation in external stress on intellectual performance, 
been reviewed comprehensively by Lazarus, et al‘). Most of this work suffers from 
an almost total neglect of variables which are a function of the internal organization 
of the individuals tested. Consequently, individual differences tend to be one of the 
main findings in studies of this type. 

The second group of studies, alert to the foregoing error, have attempted to cor- 
relate intellectual performance with internal variables as defined by various psy- 
chiatric groups. Hunt has summarized these findings up to 1944. Greenfield®?, 
more recently, compared a group of college normals with a group of neurotic pa- 
tients with respect to their performance on certain intellectual tasks while “frus- 
trated’’ and found no significant differences between the two groups. Fey®) found 
clearcut differences between schizophrenics and controls on the Wisconsin Card 
Sorting Test under different conditions of personal, impersonal, and physical stress. 
In general, in the studies of this type, schizophrenics are the only pathological group 
who tend to show consistent differences from controls on intellectual tasks. All of 
these studies were carried out under laboratory conditions and, to the clinician, there 
may be considerable danger in generalizing the conclusions reached in them to be- 
havior outside the laboratory situation. Moreover, the tests of performance were, 
in every instance, sharply discrete temporally, 7.e., the performances were measured 
only within relatively brief spans of time. To the present authors’ knowledge only 
one group of investigators has heretofore measured intellectual performance in a 
psychopathological group outside the laboratory and over a relatively extended per- 
iod. Matarazzo, et al’) have correlated academic performance over a year’s period 
with various degrees of anxiety and found no significant correlation. The present 
study utilizes the same technique on a broader psychopathological sample. 


MeErTHOD 

The population selected for study was all of the undergraduate students apply- 

ing for psychiatric assistance in the Student Health Psychiatry Section of the De- 
partment of Psychiatry of the University of Wisconsin in the four year period of 
1950-1953 inclusive. From this group all those with no psychiatric diagnosis were 
excluded, leaving a population of over 1000. From this population, in turn, the fol- 
_lowing five experimental groups were randomly selected: (a) 122 patients without 
regard to diagnosis, (b) 39 personality disorders, characterized by “‘acting out’’ be- 
havior, (c) 44 obsessive-compulsive neurotics, (d) 43 incipient psychotics and psy- 
chotics, and (e) 59 persons diagnosed as “‘adolescent reaction’’, in whom it was felt 
by the examiners there was no serious pathology. On each of these samples the mean 
academic grade-point of the group was calculated. The mean period on which the 
grade-point was calculated for the experimental groups varied from 2.8 to 3.6 aca- 
demic years. A comparison of the variances within each group was made. The mean 
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grade-point of each of these samples was then compared with the mean grade-point 
of 100 unselected persons from the total population of the university during the same 
period. In addition, where significant differences were found, the influence of in- 
telligence in accounting for the difference was tested by comparing the experimental 
group with the control group on the mean score achieved on the American Council on 
Education Test. 


HyYPorHEsEs 

For statistical purposes all hypotheses were framed in terms of the null hy- 
pothesis, 7.e., that there would be no significant differences when each of the experi- 
mental samples was compared with the normal control sample. However, the auth- 
ors’ hypotheses may be framed in positive terms as follows: 

(1) Obsessive-compulsives will perform significantly better academically than 
will a random sample of the total student population. This expectation was based 
upon the personality features of intellectualization, compulsiveness, and strong 
achievement needs which characterize this group of disorders. 

(2) Personality disorders characterized by “acting out” will perform signi- 
ficantly poorer than the sample of the total student population. This was anticipated 
because it was felt that this group, under the stress of impending examinations, 
would discharge their anxiety tangentially and would tend to study less and value 
academic achievement less than would a group of normals. 

(3) Psychotics and incipient psychotics will perform significantly poorer than 
“normals”. This was anticipated because of the clinical impressions of interference 
with thought processes, overwhelming anxiety, and behavioral disorganization. 

(4) Adolescent reactions will not differ significantly from “‘normals’’. It was felt 
that this group, being minimally sick from the psychiatric frame of reference, would 
be roughly equatable to “normals’’. 


(5) Arandom sample of the total psychopathological population will not differ 
from ‘“‘normals”’. This was anticipated because of the earlier hypotheses that per- 
formance in the psychopathological groups would be enhanced in some instances and 
impaired in others so that these opposing trends would cancel out differences in the 
total group. 


RESULTS 

The null hypothesis cannot be rejected except for the obsessive-compulsive 
group. The latter group performed significantly better than the normal controls and 
this difference was significant at better than the 1% level of confidence. When the 
obsessive-compulsive group was compared with the normal on the mean perform- 
ance on the American Council of Education Test, no significant difference was found ; 
hence, greater intelligence cannot account for the enhanced performance of this 
group. Analysis of variance within the groups revealed no significant differences. 


DIScuUssION 


Certain limitations in the design of this study should be noted. First, the path- 
ological populations concerned are those who sought assistance; whether similar 
pathological populations who did not seek help would differ cannot be answered. 
Second, it might be argued that decrements in academic performance might have 
been revealed had the period of measurement been restricted to the period of clinical 
contact, during which the patients concerned were presumably most anxious. How- 
ever, Matarazzo, et al“) have previously demonstrated that anxiety bears no re- 
lationship to intellectual performance of this character. 

With the foregoing reservations, the results of this study tend to confirm the 
results of laboratory experimentation, 7.e., that character and degree of psycho- 
pathology are not predictive of intellectual performance. The exception to this gen- 
eralization in this study, that of the obsessive-compulsive group, is actually in the 
direction of enhanced performance. For the authors, the finding that the existence 
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of psychosis or incipient psychosis and ‘‘acting out’’ personality disorders does not 
impair academic performance is surprising and contrary to subjective clinical im- 
pressions. How can one view such results most usefully? 

Hartman“? and Rapaport ©, in discussing the sparing of certain ego functions in 
psychopathology have proposed the concept of ‘‘conflict-free ego spheres’’. In terms 
of this hypothesis it might be speculated that the intellectual performances measured 
in this study are at least comparatively free of conflict and spared the encroachment 
which is attendant to other areas of behavior in emotionally sick persons. Such an 
explanation, however, is not congruent with innumerable reports by patients in this 
study of conflict attendant to academic performance of the obsessive-compulsive 
group. 

Greenfield ®) has made similar observations on the concept of ‘‘conflict-free”’ 
ego spheres in noting the importance of such non-intellectual factors as motivation 
and attention in performing intellectual tasks. He has proposed the terms ‘“‘syn- 
tonic ego-involvement”’ and ‘‘dystonic ego-involvement”’ to designate respectively 
those behaviors which are or are not consonant with the standards of the ego. This 
theory is certainly attractive when applied to the obsessive-compulsive group, 
where its implication is that studying and examination behaviors are more syn- 
tonic to the obsessive-compulsive ego than to the “normal” ego. This interpreta- 
tion is in keeping with clinical observation that for the obsessive-compulsive per- 
sonality, repetitious self-assessment is an important defense. Moreover, successful 
examination performance probably places a premium upon personality attributes 
characteristic of the obsessive-compulsive— ritualistic repetition (of facts to be 
memorized), obsessive concern for detail, great skills in conceptualization, abstrac- 
tion and verbalization. Finally, the value placed on academic achievement by the 
obsessive-compulsive seems clinically to be greater than the value to ‘‘normals’’; the 
importance of such value in motivating study is probably important. 

When this theory is applied to the remaining pathological groups, its usefulness 
is questionable. While the behaviors measured might, in part, be as useful defensive- 
ly to these groups as to the obsessive-compulsives and hence ego-syntonic, the clini- 
cian would anticipate a priori from the reports of patients that they are largely 
dystonic and threatening; hence it might be expected that performance would be 
impaired. We can only speculate that, while the threat of these behaviors may be 
anxiety-provoking, the anxiety provoked is not sufficient to disorganize behavior; 
indeed, it may be facilitory in some degree. Were the variability of performance in 
these groups significantly greater than in the normal, it would tend to lend some sup- 
port to this view. Unfortunately, this cannot be demonstrated in this study. 

The authors believe that this work underscores the need for further objective 
delineation of the ways in which ego functions are affected by psychopathology. It is 
all too easy for the clinician to mistake phenomenologic patient reports for objective- 
ly verifiable behavior. While the usefulness of such information to the clinician may 
be minimal (concerned as he is largely with experienced distress in his patients, not so 
greatly with their behavior per se) it may be helpful to other groups necessarily con- 
cerned with psychology of man. Educators, for example, may well pause before 
assuming too quickly that psychopathology, like some organic pathology, constitutes 
an adequate explanation for lowered academic performance. 


SUMMARY 

This paper reports the effects of psychopathology in a college under-graduate 
population on scholastic performance over a period of several years. Five experi- 
mental groups were randomly selected from among undergraduate students seen in 
the Student Health Psychiatry Section of the University. These were classified as 
personality disorders characterized by “acting out’”’ behavior, obsessive-compulsive 
neurotics, “adolescent reactions” in whom it was felt there was no serious pathology, 
incipient psychotics and overt psychotics as well as a group selected without regard 
to diagnosis. Comparisons of the scholastic averages of these groups were made with 
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averages of a randomly selected group of students attending the university during 
the same period of time. 

The obsessive-compulsive neurotics did much better academically than did the 
random sample of the total student population with this difference statistically sig- 
nificant at better than the 1% level of confidence. The intelligence of these two 
groups was compared without finding significant difference between them. Com- 
parison of the academic performance of the other experimental groups with the total 
student sample did not reveal any significant differences in scholastic achievement. 

With the exception of the obsessive-compulsive group the findings tend to con- 
firm the generalization derived from laboratory experiments that intellectual per- 
formance seems to be little affected by certain types psychopathology. The person- 
ality characteristics of the obsessive-compulsive individual appear to be syntonic 
with intellectual achievement of an academic type and the findings are thus com- 
patible with clinical observation. The subjective complaints of impaired intellectual 
functioning by members of the other groups are not borne out by objective measure- 
ment of their academic accomplishments. Theories are presented relating to sparing 
of some ego functions in emotional illness to explain these findings but their relative 
lack of usefulness points out the need for further objective study of the ways in 
which ego functions are affected by psychopathology. 
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SPONTANEOUS FLUCTUATIONS IN THE SEVERITY OF ILLNESS 
OF HOSPITALIZED LOBOTOMIZED PATIENTS! 


THOMAS E. HANLON, CAROLYN 8S. SHEETS AND ALBERT A. KURLAND 
Spring Grove State Hospital, Baltimore, Maryland 


PROBLEM 


The present study was undertaken to answer the following questions relative to 
the observable behavior of lobotomized patients. Is there a significant variability 
over a prolonged period of time in the severity of the mental illnesses of hospitalized 
lobotomized patients who receive no therapy other than routine custodial care? 
When fluctuation is evidenced in the psychiatric status of a patient, to what person- 
ality components may we attribute this fluctuation? Do those lobotomized patients 


1It is with a great deal of appreciation that acknowledgement is made of the authors’ fruitful dis- 
cussions with Dr. Maurice Lorr of the Veterans Administration and Dr. James P. O’Connor of the 
Catholic University of America concerning this project. It is also with a great deal of appreciation 
that acknowledgments are made to the Friends of Psychiatric Research, Inc., and the Women’s Aux- 
iliary of Spring Grove State Hospital, Inc., for their financial and volunteer assistance. A final note of 
appreciation is due to Dr. Isadore Tuerk, Superintendent of Spring Grove State Hospital, for his en- 
couragement and interest in this project. 
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who show high fluctuation in behavioral ratings have certain identifying character- 
istics in common, and how do these compare with the characteristics of those who 
show low fluctuation? 


METHOD 


In a previous study by Kurland, et al. @) the status of 122 patients who had been 
lobotomized over a ten year period at a typical state hospital were reviewed. At the 
time of this present study there were 80 patients of this group remaining in the hos- 
pital, who had either never left the hospital after their lobotomies or who had been 
unable to maintain a satisfactory adjustment outside of the hospital setting for any 
appreciable amount of time. All were considered treatment failures. Thirty-one of 
these cases, who had been on no treatment of any kind for a period of at least three 
months, were chosen as subjects for this study. All of these 31 patients had lobo- 
tomies performed at least a year prior to the study and had been hospitalized for 
periods exceeding two years. The age range of the sample was from 29 to 72 years, 
with a mean age of 46 years and a standard deviation of ten years. 

The 31 patients comprising the sample were evaluated at approximately month- 
ly intervals over a six month period with the Multidimensional Scale for Rating 
Psychiatric Patients (MSRPP)“), which includes both ward and interview behavior 
observations. During this period these patients remained in the same environmental 
setting and received only routine custodial care. For each patient the MSRPP pro- 
vided a total morbidity score, separate factor scores for the 11 factors comprising the 
scale, and three nosological pattern scores: hebephrenic, catatonic and paranoid. 
The average inter-rater reliability on the MSRPP for the three raters involved was 
.89. The reliability coefficients were determined on an independent, related sampling. 


RESULTS AND DISCUSSION 


The severity of illness for the observed population according to mean MSRPP 
morbidity scores for the six rating situations ranged from a low of 19.3 for the best 
adjusted individual to a high of 73.7 for the least adjusted, with a mean score of 43.7 
for the group and a standard deviation of 15.4. The mean score value is such that a 
patient obtaining this score would be placed at the 80th percentile when compared 
to the normative population for the MSRPP, 1.e., 80 percent of typical psychiatric 
hospital patients in the normative group obtained scores indicative of less morbidity. 
Two other total scores on the MSRPP representative of degree of mental illness, the 
“normality” and “difference” scores (similarity and dissimilarity of total pattern 
from what might be expected of normal subjects), were also obtained for each sub- 
ject. However, since the correlations between the means of these scores and the 
mean morbidity scores were .91 and .94 respectively, they were disregarded in favor 
of the latter. The group mean morbidity scores for the six evaluations, together with 
= ratios and correlations for the five between-rating situations may be found in 

able 1. 


TaB.e 1. THe SIGNIFICANCE OF THE DIFFERENCES BETWEEN THE MEANS AND THE PEARSONIAN R’S 
FOR THE Srx Ratine SrTuaTIONs 








Measures 





.42 
t* : .82 .37 
r®* ; .95 -94 








*None significant (2.04 needed for significance at the .05 level). 
**All significant at the .001 level. 
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As can be seen, the morbidity scores for the group are relatively stable over the 
six rating situations. None of the mean differences between successive ratings are 
statistically significant, while all of the correlations fall beyond the .001 level of sig- 
nificance. In a correlation matrix of the fifteen possible intercorrelations between 
ratings, none was found to fall below .80. 

Bartlett’s test for homogeneity of variance was performed for the six rating situ- 
ations prior to the application of analysis of variance techniques in the determination 
of total mean variability. A resulting chi square of 1.57 showed that the variances 
were within limits of random sampling from a population with common variance so 
that no transformation of scores was necessary. An analysis of variance method ac- 
counting for the use of the same subjects over several testings was used to determine 
the significance of inter-rating mean variability®’. The results of this analysis are 
presented in Table 2. The F ratio for between-ratings is 1.54, which is below the 
value of 2.27 needed for significance at the .05 level. The hypothesis of random 
sampling of the six mean scores from a common population is therefore upheld, 7.e., 
there is no significant inter-rating variability in morbidity for the group over the six 
rating sessions. (On the other hand, the extreme heterogeneity of the group with re- 
gard to morbidity may be noted by the highly significant F ratio for the between- 
patients variability.) When these same analysis of variance techniques were applied 
to the 11 factor and three pattern scores, in no case was the inter-rating mean var- 
iability found to be significant. 


TaBLeE 2. ANALYSIS OF VARIANCE OF THE SCORES OF THE SAME Sussects Ratep aT Six DirreRENT 
MonTsLy Prriops spy THE MSRPP 





Source of Variation Sum of Squares df Mean Squares F 





Between Ratings 272.23 54.45 
Between Patients 43943 .38 1464.78 
Residual (error) 5318 .37 150 35.46 


Total 49533 .98 185 





Although there was no marked group trend in mean variability over the six 
rating sessions the fact that there was considerable random individual variability was 
evident from a cursory analysis of the tabulated morbidity scores for each member 
of the group. A more detailed approach to individual variability was therefore under- 
taken in order to gain insight into the personality components associated with these 
individual fluctuations. An index of fluctuation from one rating to another was de- 
termined for each patient by averaging two rank order methods, one considering 
primarily the patient’s own variability over the six testing periods and the other 
taking into account his relative standing among the total group differences for each 
of the five between-rating intervals, since the mean differences and their distribu- 
tions varied with each interval: The rank order correlation between the two methods 
was .75. 

Eight (approximately 27%) of the 31 patients were then identified as “high 
fluctuators”’ and eight as ‘‘low fluctuators” according to their mean fluctuation rank- 
ings. Mean comparisons were made for these extreme groups on the MSRPP total 
morbidity score, the 11 factors underlying this score and the three nosological pat- 
tern scores. So that the entire sample might be considered, rank-order correlations 
were also determined between these scores and the fluctuation ranks of all the 31 
patients (Table 3). 

From the significantly high rank-order correlation of .56 between total morbid- 
ity and fluctuation and the correspondingly high ¢ value, it is apparent that those 
patients who have the greatest degree of mental illness also tend to show the greatest 
degree of variability in behavior from one rating situation to another. From this, it 
would seem that any experimentation on the evaluation of behavioral or personality 
change in psychiatric patients should consider the initial differentiating character- 
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TaB.e 3. THE SIGNIFICANCE OF THE DIFFERENCES BETWEEN HiGH AND Low FLucTUATORS ON THE 
MSRPP Torat, Factor anp Patrern ScorBs, AND THE RANK-ORDER CORRELATION 
BETWEEN FLUCTUATION AND THESE ScoRES 





Mean Mean Rhot 
Score igh Diffs. with 
Fluct. 


MSRPP Total Morbidity { : . bee? 
Paranoid Pattern ’ ; -86 . — .36* 
Hebephrenic Pattern . : y : 26" 
Catatonic Pattern ’ ; j P .50*** 
Depressed-manic . . ‘ 
Compliance-resistiveness : : 3 ; .45** 
Paranoid Projection 

Inactive-overactive 

Melancholy agitation 

Perceptual distortion 

Motor Disturbances 

Submissiveness-belligerence : - 

Withdrawaltt : 12.40 

Self Depreciatory-grandiose 9. 9.60 

Conceptual disorganization . 13.40 





ASSO ssoOWPS 





*Significant at the .05 level 
**Significant at the .01 level ttLower score indicative of greater withdrawal. 
***Significant at the .001 level 


{Using 1.04 (1-p*) as the standard error in determining significance, N = 31. 
VN-1 


istic of the severity of the mental illness itself, since the significance of a purported 
change is indirectly dependent on the initial status of the patient at the outset. Pre- 


cautionary measures to control this variable are especially important when either 
single cases or very small samples are involved. 

From Table 3, it also may be seen that degree of fluctuation is related most 
highly to those factors characteristic of psychotics, specifically those schizophrenic 
processes of withdrawal (Factor I, and aspects of the catatonic pattern) and person- 
ality disorganization as evidenced in thinking, perception, and motor coordination 
(Factors F, K, and G and the general pattern of hebephrenia). However, there is a 
negative relationship found between fluctuation and the schizophrenic processes of 
paranoid reorganization and stability as evidenced on the paranoid pattern score. 
Of those factors of temperament which are observable in nonpsychotics, as well as 
psychotics (B, D, and H) only the resistive end of the bipolar Factor B, compliance- 
resistiveness, is significantly related to the amount of variability from one rating to 
another. Both of the affective factors (the excitement end of the bipolar Factor A 
and melancholy agitation, E) show slight, but not statistically significant, relation- 
ships to fluctuation. Only Factor E significantly differentiates the high and low 
fluctuating groups. The underlying component of this factor is depressive agitation. 
The other scores which significantly differentiate the two groups are the total morbid- 
ity score, the three nosological patterns, and Factors B, F, G, I, and K. 

From the relationships described above, the clinical pictures of lobotomized 
patients with high and low behavioral fluctuations may be differentiated along the 
following lines. A patient whose hospital behavior is found to be erratic and unpre- 
dictable is likely to be one whose total personality makeup has become so disorgan- 
ized as a result of his illness that there is an extreme amount of distortion in thinking 
and perception and a corresponding, generalized disturbance in motor control. More- 
over, there is likely to be disorientation in any one, or all, of the three spheres. With- 
drawal tendencies tend to be such that there are little or no interpersonal relation- 
ships maintained; contacts with others in the hospital are usually of a resistive and 
uncooperative nature. Finally, the affective state of such a patient is likely to be 
characterized by an hypertension stemming from underlying depressive elements. 
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The less disorganized patient tends to maintain a stationary clinical course and 
has been identified as a ‘“‘low-fluctuator’”’ in this study. On the whole, he has achieved 
a certain amount of reorganization and stability in his thinking and exhibits a min- 
imal amount of psychotic, regressive behavior. The orientation of such a patient 
tends to be maintained, and comparatively adequate interpersonal relationships 
allow him to enjoy a fairly harmonious hospital adjustment. It is of interest to note 
the obvious similarities in stability of behavior between our typical low fluctuator and 
the S-3 schizophrenic reaction type identified by Beck? and elaborated upon by 
Molish et al.“ in their theoretical study of the treatment failure. 


SUMMARY AND CONCLUSIONS 


The behavior of 31 hospitalized lobotomized schizophrenics was objectively 
measured at monthly intervals for a period of six months by means of the MSRPP. 
During this period the patients remained in their initial ward settings and received 
only routine custodial care. Inter-rating mean variability was determined for the 
MSRPP’s total morbidity score, 11 factor scores, and three nosological pattern 
scores. The amount of individual variability was determined for each patient and 
the relationship between this variable and MSRPP scores studied. From an analysis 
of the results obtained the following conclusions are drawn: 


1. The degree of mental illness for a group of schizophrenic patients, still hos- 
pitalized after being subjected to a variety of psychiatric treatments, including psy- 
chosurgery, remains relatively stable over a long term period. Although no group 
trends in behavior are discernible, there is much individual random variation to be 


noted. 


2. Individual random fluctuation in hospital behavioral status for this group 
seems to be related to the severity of mental illness, with greater severity there is 
more variability. Future experimentation on the evaluation of change in the be- 
havior of psychiatric patients needs to consider this as a differentiating characteristic 
in sample populations. 


3. Degree of fluctuation is related most highly to the schizophrenic processes 
of withdrawal and personality disintegration as evidenced in thinking, perception and 
motor coordination. Resistiveness, as an aspect of temperament, and melancholy 
agitation are also related to behavioral fluctuations. 


4. The restitutional factors evidenced in the schizophrenic processes of para- 
noid reorganization and stability are negatively related to spontaneous, random 
changes in the status of certain psychiatric patients. 
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DIFFERENCES IN SOME PERSONAL CHARACTERISTICS OF 
“RIGID” AND “FLEXIBLE” INDIVIDUALS 


K. WARNER SCHAIE 
University of Nebraska’ 


PROBLEM 


Much has been made in recent clinical discussions of the ‘‘Rigidity-Flexibility”’ 
dimension as an important behavior attribute. A great deal of research has been 
concerned with defining the single or multi-dimensional nature of this concept and a 
variety of tests have been constructed to provide operations for its measurement.’ 
Several definitions have been proposed for specific or more general aspects of classes 
of behavior which are all related to difficulty or ease in adjusting to change in stimu- 
lus patterns and whose psychological characteristics are expressed in terms of a 
central concept of rigid and flexible behavior. In an earlier paper the following de- 
finition, which is believed to summarize most of the concepts advanced, was found 
to be useful. It defined behavioral rigidity as: 


... @ tendency to perseverate and resist conceptual change, to resist the acquisition of new 
patterns of behavior, and to refuse to relinquish old and established patterns. “) 


This definition may seem rather broad, but a set of behavioral consequences can 
readily be deduced which may then be made amenable to observational test. One 
may deduce for example, that a rigid person dislikes to change patterns of inter- 
personal relationships; has difficulty in changing from one activity to another; is less 
efficient in activities requiring alternation and continous readjustment of motor 
activities, etc. 

It was shown elsewhere that a battery of tests could be constructed to cover a 
variety of operations subsumed under the above definition. Subsequent to a series of 
factor analyses, it was also possible to show that the rigidity-flexibility concept can 
best be described by a three-dimensional system. In this sytem a “motor-cognitive 
rigidity’’ factor was seen to be a measure of effective adjustment to shifts in familiar 
patterns and to continuously changing situational demands; a ‘‘personality-percept- 
ual rigidity” factor seemed to measure changes in cognitive and environmental pat- 
terns and new situations; and a “psychomotor speed”’ factor was considered to be re- 
lated to the individual’s functional efficiency in coping with familiar situations re- 
quiring rapid response and quick thinking. “ 

An attempt was made to relate scores on these factors to other behavior var- 
iables and it was found that a linear combination of the three factor scores was most 
successful in discriminating between the “‘rigid”’ and ‘‘flexible’”’ groups. © The present 
paper is concerned therefore, with reporting some of the differences in the personal 
characteristics of subjects who were extreme high and low scorers on the composite 
rigidity measure. 

PROCEDURE 


Schaie’s Test of Behavioral Rigidity (TBR) “: *) was administered to a sample 
of 500 subjects. The Ss had been selected by a stratified-random sampling procedure 
from a larger sampling universe and their characteristics (reported in “) suggest 
that they are probably quite representative of the upper 75% of the American socio- 
economic continuum. Ss ranged in age from 20 to 70 years and were divided into ten 
five-year interval sub-groups containing 25 men and 25 women each. In addition toa 
variety of psychological tests, Ss also filled out a schedule giving some of their per- 
sonal characteristics. After these data had been punched on IBM cards, all age 


‘This paper is based on part of a dissertation submitted in partial satisfaction of requirements for 
the Ph.D. degree at University of Washington, Seattle. The author is indebted to Professors Charles 
R. Strother and Paul Horst for their advice and to the staff of Group Health Cooperative of Puget 
Sound (Seattle) for their assistance in obtaining the sample of subjects. 

*Surveys of the literature on different aspects of the concept of “‘rigidity’’ may be found in papers 
by Werner‘? and by Levine); see also“). 
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groups were separately rank-ordered on the Composite Rigidity score. The ten most 
rigid and the ten most flexible subjects in each age group were then selected for 
further study. Matching the extreme rigid and flexible individuals by age groups was 
required by the fact that rigidity-flexibility, just as many other behavior character- 
istics, shows significant age changes.“*) This method also provides more adequate 
sampling and permits wider generalization since the major portion of the adult age 
range is considered. 


RESULTS 

Means for the extreme scorers were computed for measures of family status: age 
at first marriage, age the first child was born, and number of children. Measures of 
economic status included years of education, income level and occupational level. 
Income level was assessed on a six-point scale with $2000 intervals ranging from be- 
low $2000 to above $10,000. Occupational level was assessed using the major head- 
ings of the United States Census on a 10-point scale ranging from unskilled (0) to 
professional (9). Measures of mobility included frequency of change of residence, 
frequency of change of employment, and frequency of change of job classification. 
Self-ratings of happiness and success used a 5-point scale ranging from very happy or 
very successful (a score of 1) to very unhappy or very unsuccessful (a score of 5). A 
shortened version of the Social Responsibility scale constructed by Gough, Mc- 
Closkey and Meehl®? was also used. Table 1 gives the means for the rigid and flex- 
ible groups on all these variables. 


TaBLe 1. MEANS FOR THE PERSONAL CHARACTERISTICS OF THE EXTREME “Ricip’’ AND 
“FLEXIBLE ’’? Groups 





Variables Rigid Flexible Difference 
(N = 100) 





Family status: 
Age at marriage 
Age at birth of first child 
Number of children 
Economic status: 
Years of education 
Occupational level 
Income level 
Mobility: 
Change of quarters 
Change of employment 
Change of type of work 
Self-ratings: 
Happiness 
Success 
Social responsibility 
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14 


Sts 
Sh 


nn COS NHO WO 
Ros 


— 
won bo 


? 


** * 
** * 


288 SA 


+. 
* 


en S38 


ae ae 
Ra 
CS CSS ONwW One 


i) 
= 
7 
* 


'33** 
3.12** 


Ww 


96 
26 
94 
84 
70 
77 
84 
29 
07 
56 


bo 
oo 
—_ 
w 
w 
She 
ens 
aSa 





*Since some of our subjects were unmarried, the N for the measures of family status are as 
follows: Age at marriage, rigid = 76, flexible = 70; age at birth of first child, rigid = 90, flexible = 85; 
number of children, rigid = 90, flexible = 90. 

**Significant at or beyond the 5% level of confidence. 


The analysis of variance was next used to test the null hypothesis for differences 
between the two groups on each variable and a factorial design, using methods sug- 
gested by Edwards, was employed which allowed for tests of the significance of 
differences between age groups, between flexible and rigid sub-groups, and of the 
interaction between age and rigidity-flexibility. Table 2 summarizes the results of 
these analyses. The F ratios are evaluated with one and 190 degrees of freedom for 
the difference between rigid and flexible groups and with nine and 190 degrees of 
freedom for the differences between age groups and for the interaction.* No F ratio 
is given where the test would be trivial. 


*The F ratios on the measures of family status were evaluated with the reduced sample sizes as 
indicated in the footnote to Table 1. 
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Tasie 2. Tue ANAtysis or VARIANCE TESTING THE DIFFERENCES IN GENERAL CHARACTERISTICS 
BETWEEN RIGID AND FLEXIBLE INDIVIDUALS AND BETWEEN AGE Groups 


Rigidity- 
flexibility Age Interaction Indiv. 
Variable MS F MS F MS F Diff. 


Family status: 
Age at marriage 47 .02 2.46 : 1.34 19.12 
Age at birth of 
first child 83.19 ; . 29.25 
Number of children 1.25 , : : 1.28 
Economic status: 
Years of education 796.01 ; : 1.26 
Income level 32.00 : : 1.68 
Occupational level 239.81 . 
Mobility: 
Change of residence 12.50 
on 1.45 
employment S 
Change of 
occupation 0.08 
Self-ratings: 
Happiness 4.80 6 .96** 
Success 5.45 5 .50* 
Social re- 
sponsibility 486 .72 27 .30** 











‘Significant at the 5% level 
**Significant at the 1% level 


An examination of the mean scores for the rigid and flexible individuals on those 
variables where significant differences were shown in the analysis of variance dis- 
closes that the flexible individuals exceed the rigid ones in the posi‘ive direction in all 
cases. The extremely flexible members of the sample have had on the average four 
more years of education than the rigid. The mean income of the flexible Ss is 0.8 
scale points or approximately $1600 higher. The mean occupational level of the flex- 
ible Ss exceeds that of the rigid by two scale intervals, indicating that our average 
flexible S was occupied at the managerial or proprietary level, while the average rigid 
S belonged to the skilled labor group. The flexible members of the sample also des- 
cribed themselves as more successful and happier, and they scored higher on the 
scale of social responsibility suggesting that they were more conscious of and sub- 
scribing to community maintaining values. 

No significant differences are noted on the measures of mobility, except fre- 
quency of change of quarters. Here it is interesting to note a significant interaction 
between rigidity-flexibility and age. The younger flexible Ss exceed the rigid ones on 
residential mobility but the older rigid Ss exceed the flexible ones. One might specu- 
late that, while in youth frequent changes of residence may be a function of taking 
advantage of changing opportunities, it may in the older group reflect difficulty in 
adjusting to environmental changes, there being a greater tendency for the rigid to 
move whenever the characteristics of their residential area changes. Another ex- 
planation for this reversal might be found in the fact that the lower income level of 
the rigid may force change in living quarters. 

There was some tendency, although statistically not significant, for flexible 
people to marry later and to be older at the birth of their first child than the rigid 
members of the sample. The analysis also confirms significant age changes on all 
personal characteristics except for the self-ratings of happiness and success and the 
measure of social responsibility. 


SUMMARY 


500 subjects, half men and half women, ranging in age from 20 to 70 years, were 
divided into ten five-year age intervals and rank-ordered in terms of a composite 
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rigidity score. The ten most flexible and ten most rigid individuals in each group 
were examined with respect to certain of their personal characteristics. Results of an 
analysis of variance gave significant differences in favor of the flexible group on the 
measures of: years of education, income level, occupational level, self-ratings of hap- 
piness, success and social responsibility. Significant differences and interaction were 
found between rigidity-flexibility and age on a measure of mobility (change of resi- 
dence) with the young flexible and the old rigid being the more mobile groups. 
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FURTHER STUDIES ON THE BENDER-GESTALT TEST AND THE 
DIGIT-SPAN TEST AS MEASURES OF RECALL! 


ALEXANDER TOLOR 
USAF Hospital, Parks Air Force Base, California 


PROBLEM 

In a previous study? significant group differences in recall on the Bender- 
Gestalt Test and the Digit-Span test were found when patients suffering from organic 
brain injury were compared with patients having a variety of psychogenic disorders. 
It was observed further that these differences tend to be more pronounced on the 
Bender-Gestalt Test than on the Digit-Span test. The present investigation is a 
replication study which has as its main purpose the determination of whether the 
previously noted differences are maintained when a better controlled experimental 
design and new samples of functional and organic patients are used. 

In the present investigation two different groups of functional disorders were 
selected, i.e., a group containing only patients diagnosed as having schizophrenic 
reactions and a group consisting of patients having only character and behavior dis- 
orders. The performance of these diagnostically homogeneous samples was then 
compared with that of a sample of patients having some type of organic brain syn- 
drome. The use of these two markedly different groups of functional disorders also 
makes it possible to evaluate the effects of severity of emotional disturbance on 
Bender and digit recall. 


PROCEDURE 
All patients participating in this study had been referred for psychological test- 
ing at Parks Air Force Base Hospital. The 18 patients comprising the character and 
behavior disorder group were all males. Included were 8 individuals diagnosed as 
having schizoid personalities, 2 with paranoid personalities, 1 antisocial personality, 


. ‘The opinions and conclusions do not necessarily represent those of the Department of the Air 
orce. 
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1 inadequate personality, 4 emotional instability reactions, 1 passive-aggressive 
reaction, and 1 passive dependency reaction. 

Among the 18 psychotic patients there were 16 males and 2 females. The follow- 
ing diagnoses were represented in the group: 8 schizophrenic reactions, paranoid type; 
2 schizophrenic reactions, latent type; and 8 schizophrenic reactions, not elsewhere 
classified. The diagnosis of schizophrenia was not generally an obvious one since 
ordinarily only the more complex cases constituting a diagnostic challenge were 
seen on referral. 

All the patients in the organic group were male patients. The most frequent 
single diagnosis found here was that of encephalopathy due to trauma of which 
there were 8 cases. The other diagnoses covered a variety of organic disorders with 
demonstrable physical etiology or associated structural changes in the brain. In this 
category the most obvious cases of organicity were not generally seen but more fre- 
quently only those patients were encountered who represented a diagnostic problem 
at the beginning. It is precisely these less obvious cases who offer the clinician the 
opportunity of making his greatest contribution for diagnostic differentiation. 

Since in the earlier study it had been demonstrated that both measures of recall 
are substantially correlated with intelligence, the present experimental design called 
for the matching for intelligence of each pair of subjects so that a strict controlling 
for this variable was accomplished. The subjects were also matched on the age 
variable as far as possible. Table 1 presents the essential characteristics of the three 


TasBLeE 1. CoMPOSITION OF THE THREE GROUPS 





Groups 
Character & Behavior 
Item Disorder Schizophrenic Organic 
18 18 18 
Range 19-40 19-41 18-40 


Mean 25.4 25.8 28.0 
WAIS Full Scale IQ 

Range 77-124 78-123 77-125 

Mean 99.5 99.4 98.8 


a 





N 
Age 





groups of patients. The success of efforts to equate the groups is indicated by the 
fact that none of the group mean differences on these variables approached signifi- 
cance. It should be noted that the groups were formed without prior knowledge on 
the part of the examiner of the patients’ performance on tests of recall. 

The procedure utilized in obtaining Bender and digit recall scores was identical 
to that used in collecting data in the previous study“. Essentially this involved the 
individual testing of all subjects who were routinely referred for psychological testing. 
After the Bender-Gestalt Test was administered in a standard way, each patient 
was instructed to reproduce from memory as many designs as he could. In addition, 
patients were also requested to reproduce any part of the designs that they could 
recall. The total number of whole and part, correct and incorrect designs drawn con- 
stituted the overall Bender recall score. 

Instead of using the Wechsler-Bellevue Scale, Form 1, as had been done former- 
ly, in the present experiment the Wechsler Adult Intelligence Scale was administered. 
Inasmuch as the Digit-Span test is identical on the two scales, there was no differ- 
ence in the manner in which the four digit recall scores, namely, total digits, digits 
forward, digits backward, and difference between the Digit-Span weighted score and 
the patient’s average WAIS score, were obtained. 


RESULTS 


The major findings are contained in Table 2 presenting a comparison of the 
three groups of patients on five selected recall measures. 
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TaBLE 2. Summary oF BenpER-GEsTALT Test AND Drait-Span Test Recau Scorzs ror THREE 
DirFerRENtT Groups (N = 18) or PaTIENTs 





Character & Behavior Organic Brain Schizophrenic 
Disorder Disease Reactions 
Recall Measure Mean SD Mean SD Mean 





Bender Figures 6.3 , 2 : 5.8 
Total Digits 11.0 j : . 1 
Digits Forward 6.4 ‘ 

Digits Backward 4.6 

Difference Between 


1. 
6. 
. . 5. 
Digit-Span and Mean WAIS Score _ 


1. 
2. 
1. 
1. 
2. 





Table 3 shows the level of significance attained when critical ratios are com- 
puted in comparing each group’s performance on the five measures of recall with 
every other group. Since the groups were matched for intelligence, the following r 
were introduced into the formula for the standard error of the difference between 
means: .50 between intelligence and Bender recall, .70 between intelligence and 
total digit recall, .60 between intelligence and digits recalled forward, .65 between 
intelligence and digits recalled backward, and .70 between intelligence and the 
difference between the Digit-Span test and the mean WAIS score. These correlations 
were taken from the previously performed study. 


TaBLe 3. Criticat Ratios AND LEVELS or SIGNIFICANCE IN CoMPARING THREE 
Groups OF PATIENTS ON SEVERAL MEASURES OF RECALL 


oar sane Character & Behavior Schizophrenics Character & Behavior 
0 vs. vs. vs. 
Recall Organics Organics Schizophrenics 


Bender Figures 3.26** 2.05* 

Total Digits 3.06** 4.14** 

Digits Forward 3.33** 4.66** 

Digits Backward 1.30 4.03** 

Difference Between 4.13** 4.04** 
Digit-Span and Mean WAIS 


*Significant at the .05 level or better 
“Significant at the .01 level or better 











It will be observed from Table 3 that with the exception of number of digits 
recalled backward, all of the recall measures selected reveal a significantly superior 
performance of the patients belonging to the character and behavior pattern group 
as compared to the organic group. In comparing the organic with the schizophrenic 
patients, all of the recall scores for the schizophrenics exceed to a significant degree 
the recall scores obtained by the patients with organic brain injuries. On the other 
hand, with the exception of the significantly greater recall of digits backward for 
schizophrenic as compared to character and behavior disorder patients, none of the 
differences between these two psychogenic groups are significant. 

Further comparisons were made between groups using the Median Test. In 
applying this method to test the hypothesis that the population of organic patients 
has a smaller median in total number of Bender figures recalled than the population 
of character and behavior disorder patients, confirmation is found for this hypothesis. 
The chi square of 2.76 is significant at the .05 level using a one-tailed test. A similar 
test with respect to total number of digits recalled indicates that the organic patients 
tend to have a smaller median than patients with character and behavior disorders 
(chi square of 2.76 is significant at the .05 level using a one-tailed test). The Median 
Test failed to support the hypothesis that a greater number of organic cases will fall 
below the combined median for organic and schizophrenic patients with reference to 
total number of Bender designs recalled. However, support was found for the hy- 
pothesis that a greater number of organic patients will fall beneath the combined 
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median for the two groups in total number of digits recalled. The chi square of 
4.08 yielded a P of .025 using a one-sided test of significance. In each of these non- 
parametric analyses the Yates’ correction was applied. 

The frequency with which a mixed group of psychogenic patients and a sample 
of organic patients recall the various Bender designs has been reported by Tolor @), 
An attempt is made here to report similar data on more homogeneous groups in order 
to determine the stability of differences in the frequency of design recall. Table 4 
presents a breakdown of the proportion of each group recalling each one of the nine 
Bender designs as well as the rank order of recall. 

Although the three groups do differ somewhat in the ease with which patients 
recall the various designs, in general Designs 3 and 4 appear to be among the most 
difficult whereas Figures 8 and 6 are usually found among the easiest. These results 
are similar to those obtained in the above-mentioned study as well as those reported 
by Reznikoff and Olin®?. In contrast to the later investigation, however, the present 
experiment shows Design 7 to be one of the easiest for recall. 

Further evidence for the high degree of similarity in the Bender recall pattern is 
derived from the high rank-difference correlations between groups. The highest cor- 
relation (.90) is found between the organic and character and behavior disorder 
groups whereas the lowest (.75) is obtained between the schizophrenic and organic 
—- The correlation between schizophrenic and character-behavior disorder 
recall is .85. 


Discussion 


The results of this study provide further support for the hypothesis that there 
are significant differences in immediate recall, as measured by the Bender-Gestalt 
Test and the Digit-Span test, between organic and psychogenic patients. As was 
noted previously, these group differences are certainly insufficient in magnitude to 
be of diagnostic value for individual cases; however, they do consistently demon- 
strate that memory impairment tends to be far more severe if secondary to an organic 
brain injury than if it is a consequence of a functional disorder. These results are 
even more impressive when one considers the strict matching of subjects on the intell- 
igence variable and the close matching for age, both of which might otherwise have 
resulted in obscure or inconclusive results. 

In addition, the results indicate that with the exception of one measure of recall, 
namely, the number of digits repeated backward, the severity of the functional dis- 
order has no effect whatsoever on the basic finding that the immediate memory of 
psychogenic patients tends to be superior to that of organic patients. Thus, patients 
who have developed some type of character defect resemble much more closely in 
their ability to recall test material patients who are psychotic than they do individ- 
uals with some kind of organic brain syndrome. Reznikoff and Olin®? also noted the 
superior performance of schizophrenic patients as compared to organics, but only on 
one of their three measures of Bender recall. 


Tase 4. Proportion oF SUBJECTS IN THE THREE Groups RecALLING Eacu or THE NINE BENDER 
Designs AND RANK ORDER OF RECALL 


Organics Schizophrenics Aentes < Baer 
rders 
Proportion Rank Proportion Rank Proportion Rank 
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Inasmuch as the character and behavior pattern group did not differ signifi- 
cantly from the organic group in number of digits recalled backward, whereas both 
psychogenic groups differed from the organic group in Bender figures recalled, there 
is partial but inconclusive support in this study for the impression that the Bender 
recall is a better measure for discrimination than the Digit-Span test. The poor 
performance of the character and behavior disorder group in recalling digits back- 
ward might be a function of these patients’ relative intolerance to strong situational 
stress. This stress, it would be reasonable to assume, is greater for recalling digits 
backward than for repeating them forward. 


SuMMARY 


Three groups of patients, a schizophrenic group, a character and behavior dis- 
order group, and an organic brain injury group were compared for their performance 
on several measures of recall which had been derived from the Bender-Gestalt Test 
and the Digit-Span test. All patients had been matched for intelligence and age. 
The results were in essential agreement with previous findings in demonstrating that 
the performance of psychogenic patients tends to be superior to that of organic 
patients on both tests. It was shown further that the severity of the functional dis- 
order bears little relationship to the differences between organic and non-organic 
groups and that the order of difficulty in recalling the nine Bender designs tends to 
be similar for all samples tested. 
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AGE, INTELLIGENCE, APHASIA AND THE SPIRAL AFTER-EFFECT 
IN AN EPILEPTIC POPULATION 


BERNARD 8, AARONSON 
New Castle State Hospital, New Castle, Indiana 


PROBLEM 


The ability to perceive the spiral aftereffect illusion has been proposed by 
Price and Deabler“ as a means of distinguishing between individuals with and 
without brain damage. A number of other investigators“: *: ©) have shown that the 
procedure has diagnostic utility, although most have not reported the striking re- 
sults noted in the original article. The present study is an attempt to examine this 
phenomenon as it appears within the context of an epileptic population with an high 
incidence of mental deficiency. By examining the relationship between performance 
on this task and such variables as age, intelligence, severity of anomia, extent of dis- 
order in sensory perception, and laterality of sensory disorders, an attempt is made 
to isolate some of the factors entering into performance on this task. 


PROCEDURES 


The spiral aftereffect was administered to 65 consecutive patients seen in the 
psychology department for any diagnostic purpose. Patients not in sufficient con- 
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tact to take the test were excluded. The sample consisted of 35 males and 30 females 
who ranged in age from 8 to 70. The mean age was 31.3 with an 8.D. of 17.1 years. 

The stimulus material consisted of two six-inch white cardboard disks with a 
920° Archimedes spiral painted on each with black paint. The spirals pointed in op- 
posite directions so that they gave opposite effects of rotation when placed on a 78 
rpm phonograph turntable. The method of administration and scoring followed 
Gallese“? and minimized the effect of verbal factors. Each stimulus card was pre- 
sented twice in rotating order and a point was scored every time the aftereffect was 
correctly reported. An aftereffect of contraction was considered “correct” for the 
stimulus which seemed to expand during presentation, and one of expansion was con- 
sidered ‘“‘correct”’ for the stimulus which seemed to contract. The range of possible 
scores was from 0 to 4. 

The distribution of scores was obtained and the percentage of cases obtaining 
scores of two or less compared with that reported by Gallese. For the remainder of 
the comparisons, the sample was dichotomized between those perceiving the proper 
aftereffect on all presentations and those failing on any presentation. This was done 
as it was felt that in this population where organicity is the presumption, the use of a 
lower cutting score would tend to obscure some of the true variance. 

The sample was divided into three equal groups on the basis of age, and the 
interaction of age and spiral performance tested by chi square. As a measure of the 
relationship between these two variables when viewed against a normal surface, a 
biserial correlation was computed. 

The Wechsler-Bellevue Intelligence Scale, Form I was administered to 40 
patients in this sample. This group consisted of 22 males and 18 females with a mean 
age of 32.6 and an S.D. of 15.2 years. Their verbal, performance, and full scale IQs 
were compared with their spiral performance by chi square and biserial correlation. 
The chi square values were corrected by Yates’ correction for continuity. 

An aphasia screening test devised by Reitan “) was administered to 50 patients 
in this sample. This group consisted of 27 males and 23 females, with a mean age of 
30.4 and an §.D. of 17.4 years. Performance on the spiral was compared by chi 
square, which was corrected for continuity, with the ability to identify orally the 
drawings of three ‘“‘concrete’”’ and three “‘abstract’’ objects. The concrete objects 
were : baby, a fork, anda key. The abstract objects were a square, a cross, and a 
triangle. 

This test also includes a set of procedures to test for impairment of visual, 
auditory, and tactual receptive function. These procedures measure failures of per- 
ception of stimulation through these modalities and suppression of stimulation to 
one side when the modality is stimulated bilaterally. Forty seven patients produced 
usable records with regard to all of these modalities. Some patients had to be elim- 
inated because of confusion as to which side was being stimulated in a given sensory 
modality and some because of chronic conditions which made it difficult to test in 
that area. These included chronic deafness, very poor visual acuity, and the absence 
of hands. Performance on the spiral was compared by chi square with the number of 
different sensory areas in which impairment was noted. Since the absolute number 
of those showing impairment in any given sensory avenue was too small to meet the 
assumptions of chi square, the hypothesis that impairment would go along with 
lowered spiral performance was tested by Yule’s coefficicu. of association, Q“. 
Finally, the 22 cases showing a preponderance of right-sided sensory involvements 
were compared with regard to their spiral performance by chi square, which was cor- 
rected for continuity, and by Q with the 12 who showed left-sided involvements. 


RESULTS 
As shown in Table 1, the distribution of spiral scores tends to be U-shaped. In 
the present sample, 44% obtained scores of two or less, which is a greater number 
than that reported by Gallese“? for his mixed epileptic and alcoholic groups and 
comparable with the results reported with the remainder of his organics. 
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TaBLe 1. DustTripuTion or Sprrat Scores IN THE 
SAMPLE AND THE PROPORTION OBTAINING Eacu Scors 





Score N 








When performance on the spiral is compared with age, no relationships are 
noted. A chi square of 0.86 is obtained, which is not significant. Inspection of the 
actual scores, however, does suggest a trend for older individuals to obtain poorer 
scores. A biserial correlation for these data is -.21, which is not significant. 

Comparison of spiral performance with Wechsler scores yields significant asso- 
ciations with verbal, performance and full scale IQs. The chi square values for spiral 
performance with these scores are 6.40, 10.00, and 6.40 respectively, which are sig- 
nificant at the .02 and .01 levels. Biserial correlations calculated for these same data 
are .55, .66, and .63 respectively, of which the first is significant at the .01 level and 
the latter two at the .001 level. 

Only four patients in the sample failed to name all three drawings of concrete 
objects and none of these reported the aftereffect correctly on all four exposures. In 
comparing spiral performance with ability to name geometrical shapes, one error in 
naming was allowed because of the low educational level of the group. The distribu- 
tion of naming scores was dichotomized between one and two errors. The-chi square 
obtained in this comparison is 10.15, significant at beyond the .01 level. 

When spiral performance is compared with the number of sensory modalities in 
which disorders of perception occur, a chi square of 3.81 is obtained, significant at the 
.06 level. The coefficients of association between spiral performance and specific 
sensory impairments are shown in Table 2. Poor spiral performance seems highly 
associated with impairment of visual function and to a lesser extent with auditory 
impairment. 

TasBLe 2. Corrricrents oF AssociaTION BETWEEN 
Sensory IMPAIRMENT AND SprraL AFTEREFFECT FAILURES 








Area of Impairment 


N 
Impaired Nonimpaired Q 


Auditory 11 37 .48* 
Visual 11 38 
Tactual 21 28 :30 














*p is at least .05 
**p is at least .001 


Comparison of the relation of laterality of sensory impairment with spiral per- 
formance yields a chi square of 1.16, which is not significant. Since a trend for lower 
scores to be associated with right-sided sensory impairment was noted, Yule’s co- 
oo association was calculated and a Q value of .49 obtained, significant at the 
.08 level. 

he data support the validity of the spiral aftereffect as a measure of brain 
da.nage. The agreement noted with Gallese’s results“), the tendency for perform- 
ance to drop with IQ, anomia, and extent of sensory impairment all suggest that 
performance on this task tends to decline as brain damage becomes more severe. 

While the greater association with the Performance IQ on the Wechsler might 
tend to suggest that the test is tapping a right-sided brain function“, the strong 
association with anomia tends to implicate the left hemisphere. This last seems to 
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be buttressed by the trend for individuals with right-sided sensory impairment to 
get poorer spiral scores. If the pattern of association between the spiral and visual, 
auditory, and tactual impairment is taken into account, the data suggest a primary 
involvement of the visual pathways in the temporal area. An involvement impinging 
on the visual pathways in this region would also tend to interrupt auditory fibers. 
Involvements posterior to this might show more relation with tactual impairment 
and less with auditory. 

The association of lowered spiral performance with anomia raises the question 
as to what extent impaired performance is a function of being unable to perceive the 
spiral aftereffect and to what extent it is a function of being unable to express verbal- 
ly changes in an ambiguous stimulus. Goldstein and Scheerer®? have previously 
pointed out the difficulty that organics have in conceptualizing ambiguous stimuli. 
Insofar as left temporal lobe involvement tends to be associated with impairment of 
language function and in these data there is a suggestion of an association between 
both temporal lobe involvement and left hemisphere involvement and spiral per- 
formance, the extent to which performance on this task is a function of verbal factor 
requires further evaluation. 


SUMMARY 


The spiral aftereffect was administered to 65 epileptic patients of whom 40 had 
also taken the Wechsler-Bellevue, Form I, and 50 had taken an aphasia screening 
test. The percentage of individuals making two or more errors agreed with that 
found by Gallese for organics. No relation to aging was found. Poorer spiral per- 
formance seemed associated with lower Wechsler scores, anomia, the number of 
different sensory avenues impaired, visual and auditory impairment, and right-sided 
sensory impairment. The data suggest that the test is most sensitive to temporal 
lobe involvement and may be most sensitive to left cerebral hemisphere involve- 
ment. The question is raised as to whether lowered performance on the spiral arises 
from inability to perceive the aftereffect or inability to verbalize changes in an ambig- 
uous stimulus. 
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WECHSLER BELLEVUE CHANGES OVER TIME 
STEWART G. ARMITAGE AND DAVID PEARL 
Veterans Administration Hospital, Battle Creek, Michigan 


PROBLEM 


Numerous unsuccessful attempts have been made to determine cognitive pat- 
ternings on the Wechsler Bellevue Intelligence scale which could be utilized in the 
differential diagnosis of psychopathology. Many clinicians, however, continue to 
envisage varying aspects of intellectual functioning as being selectively affected by 
differing psychopathological processes. One such distinction is often made between 
the effects of schizophrenic processes and non-psychotic (non-brain damaged) mal- 
adjustments on cognition. While maladjusted non-psychotics may demonstrate in- 
efficiencies in intellectual functioning, their performance is assumed to remain more 
stable than is that of persons undergoing a schizophrenic process. This concept of 
deficit in schizophrenia as an ongoing process can be detected in the distinction often 
made between the early and the more chronic schizophrenic patient. Some have held 
that in early schizophrenia, obvious intellectual impairment is not apparent, subtle 
qualitative evaluations of test performance being necessary to detect beginning 
thought deficits, whereas intellectual loss is more easily seen in the more chronic 
schizophrenic patient. A further distinction often is made between schizophrenic 
categories. Thus, the paranoid schizophrenic patient supposedly undergoes less loss 
than the other types. 

The purpose of this study was to investigate the concept of process as applied to 
intellectual functioning in the instance of schizophrenic and non-schizophrenic, 
non-brain damaged subjects using the Wechsler-Bellevue Intelligence Scale, Form I, 
which was given to a group of 162 hospitalized schizophrenic and non-psychotic 
(non-brain injured) subjects upon their initial and again upon a following hospital- 
ization. It was proposed to determine the stability of intellectual performance over 
time through the correlation of initial and subsequent Wechsler-Bellevue perform- 
ance, and to evaluate the following two hypotheses: (a) that greater intellectual 
deficit will occur over time in schizophrenic subjects than in non-psychotic patients 
and (b) that paranoid schizophrenics undergo less intellectual deficit over time than 
other schizophrenic subjects, in this instance, those diagnosed as unclassified schizo- 
phrenics. 

PROCEDURE AND RESULTS 

As previously indicated, 162 rehospitalized neuropsychiatric patients diagnosed 
similarly on both hospital admissions as either paranoid or unclassified schizophren- 
ics or non-psychotic were retested with the Wechsler-Bellevue scale. Mean age for 
patients upon first hospitalization was 30.5 years (SD 7.5) and average length of 
time between hospitalizations was 1.5 years. Only the above subcategories of schizo- 
phrenia were utilized, since too few cases of the other types were encountered. Since 
none of these subjects had been hospitalized prior to their first admission, it may be 
assumed that they, particularly the schizophrenic subjects, were in the early or less 
chronic stages of their maladjustment at the time of initial hospitalization. 

Data bearing on the question of stability was obtained through linear correla- 
tions of initial and readmission scores on Wechsler-Bellevue subtests, verbal, per- 
formance and full scale IQs, and are presented in Table 1. Although these cannot be 
considered as measures of reliability since an average of 18 months occurred between 
tests and clinical change may have resulted, they are comparable to Wechsler- 
Bellevue scale reliabilities found by other investigators“: ?: 3. 4), 

In order to determine intellectual changes between hospitalizations, elapsed 
time between test administrations was trichotomized on the basis of its distribution. 
Since ageing of subjects may affect Wechsler-Bellevue changes, it was controlled by 
considering test score changes for subjects of three age categories, 25 years or less, 
26-32 years and 33 years and over. Evaluation of the two hypotheses was investi- 
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TaBLE 1. WECHSLER-BELLEVUE INTELLIGENCE ScaALE, Form | Test-Retest CoRRELATIONS 
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gated through analyses of variance by means of a triple classification factorial design 
in which the variables of diagnostic category, subjects’ ages at initial testing and 
elapsed time between successive test performances were each varied three ways. 
Since the use of all 162 subjects did not provide equal or proportional representation 
for all cells of the factorial design, it was necessary to set up a ratio of four unclassi- 
fied schizophrenic to two each of paranoid schizophrenic and non-psychotic subjects 
for each factorial cell involving age and test time elapsed. A total of 36 unclassified 
schizophrenic and 18 each of the other two diagnoses were randomly selected from 
the larger subject population for these analyses. The nature of the variance analyses 
is illustrated in Table 2. 


TaBLe 2. ANALYSIS OF VARIANCE DESIGN FoR WECHSLER- 
BELLEVUE SCALES AND SUBTESTS 


Source Degrees of 
Freedom 








Diagnosis 

Time Interval 

Diagnosis x Time Interval 
Diagnosis x Age 

Time Interval x 

Diagnosis x Time Interval x Age 
Error 
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In general, results of these analyses fail to show any of these variables or their 
interactions as exercising a significant effect upon Wechsler-Bellevue score changes. 
Only two variables were found to be significantly related to change, elapsed time for 
Picture Arrangements and diagnosis for the Vocabulary subtest. In the first, sub- 
jects retested within nine months obtained improved scores, whereas no changes oc- 
curred for those with a longer interval period, a result which may be attributable to 
a practice effect. In the second, non-psychotic subjects’ vocabulary improved, while 
no change occurred for the two schizophrenic groups. Two variable interactions were 
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significant, Age x Diagnosis in the Arithmetic subtest and Elapsed Time x Age in 
the Digit Symbol subtest. Since 98 tests of significance were made and only four 
significant relationships were found, these must be considered as probably due to 
chance, so that neither of the study’s two hypotheses are supported by the data. 


DIscussIon 


Despite an average elapsed time of 18 months between tests, achievement on 
Information and Vocabulary subtests remained highly constant as did Verbal and 
Full Scale [Qs and to a lesser extent, Performance Scale IQs and Block Design sub- 
test scores. Table 1 indicates that test-retest correlations of this study are generally 
similar to those found by previous investigators. Considering the unreliability of 
many of the subtests, it is indeed surprising that there should be any substantial 
agreement. Such correspondence would suggest that any changes observed in test 
scores of the study’s subjects may be attributable to test unreliability rather than 
to any clinical changes occurring between tests. 


SUMMARY 


The stability of Wechsler-Bellevue Intelligence Scale, Form I, performance of 
162 schizophrenic and non-psychotic patients was determined by correlating initial 
test achievement with test performance on a subsequent hospitalization. Deficit 
occurring in unclassified, paranoid schizophrenics and non-psychotic patients in the 
interval between tests was investigated by means of a factorial design with diagnosis, 
subjects’ ages and intervening time between tests as variables. In general, test- 
retest correlations are similar to those found in reliability studies. No differences 
were found between the three diagnostic groups with respect to changes on various 
Wechsler-Bellevue measures. 
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A COMPARISON OF THE BENDER GESTALT REPRODUCTIONS 
OF DELINQUENTS AND NON-DELINQUENTS! 


EDWIN 8. ZOLIK 
Marquette University 


PROBLEM 


The Bender Gestalt (B-G) Test has been little used with juvenile delinquents, a 
group in which emotional maladjustments may be assumed to play a conspicuous 
role, Since the B-G assumes that the reproduction of visuo-motor gestalten is 
influenced and distorted by intellectual impairment and emotional maladjustments 
(t, 4, 8), we may hypothesize that when maturational and intellectual factors are held 
constant, the test performance of juvenile delinquents and non-delinquents will 
differ significantly. 


'This study is based upon a thesis submitted by the writer to the Graduate School of Arts and 
Sciences of the Catholic University of America in ial fulfillment of = requirements for the M. A. 
degree in psychol The writer is indebted to Professor William D. Commins and Dr. Helen Peix- 
otto for their valuable assistance. We also wish to thank Dr. Raymond McCall for his critical reading 
of this manuscript. 
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METHOD 


Forty-three adolescent delinquents were individually matched with a group of 
forty-three non-delinquents, residing in a high delinquency area, in the factors of 
age, Otis IQ, and absence of any type of motor defect. An initial attempt to equate 
the groups on the variable of education did not prove feasible. The mean age of the 
control group is 15.95 years, (s.d. = 0.92), whereas the delinquent group has a mean 
of 16.12 years, (s.d. = 0.90). The mean Otis IQ for the control group is 105.28 
(s. d. = 8.90) and for the delinquent group the mean is 101.65, (s. d. = 8.39). The 
Pearson r between the ages of the two groups is .81 and between the IQ’s is .61, both 
of which are significant at the .001 level. 


REsULTS 


An analysis of the B-G figures, comparing the control and delinquent groups, 
was made with respect to both the sum total score and the individual figure scores 
employing Pascal and Suttell’s®) method. Table 1 gives the critical ratios obtained 
and their significance. In order to ascertain which scored aspects of each figure con- 
tributed to these significant differences, the performances on each figure were further 
analyzed and critical ratios computed. 


TABLE 1. CoMPARISON oF DELINQUENT AND Non-DELINQUENT ScoRES ON THE BENDER-GESTALT 
Ficures (N = 43) 
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ae a al is significant (.01 level) on the three figures (Figs. 3, 5, 6) where 
it is scor 

The variable “Dots, Dashes, Circles” is significant (. 01 level) on two of the three 
figures (Figs. 3, 5) where it is a scored deviation. From Pascal and Suttell’s results 

- 48) it appears that this is indicative of the possibility of either a maturational 
failure or of emotional components giving rise to regressive tendencies in our de- 
linquent group. 

“Tremor” is a significant indicator (Fig. 4: .05 level; Fig. 7: .001 level) on two 
of the four figures on which it is a scorable variable. This may be indicative of a 
higher degree of neuromuscular incoordination under conditions of tension and 
anxiety in the delinquent. 

“Second attempt’’, a scorable deviation on all the figures, was significant (.05 
level) on Fig. 4 only. Hutt“) considers the curvilinear part of Fig. 4 to have dynamic 
implications concerning an individual’s prey meg with female figures. Since the 
“second attempt” occurred on this part of the fi awe in all the delinquent records, it 
appears that further research should be directed at substantiating Hutt’s formula- 
tion. 

“Wavy line” is significant on Fig. 1 (.05 level). “Dashes” is significant on Fig. 1 
(.05 level). “Extra angles”’ is significant on Fig. 7 (.01 level). 

Pearson r’s were computed between age and intelligence and the B-G scores for 
both the delinquent and non-delinquent groups. The correlation between IQ and the 
B-G score was -.13 for the delinquent group and -.16 for the non-delinquent group, 
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neither correlation reaching significance. Between chronological age and B-G score 
r is .04 for the delinquent group and .01 for the non-delinquents, neither of which is 
significant. The obtained coefficients are comparable to those reported by Pascal 
and Suttell® »- 2. 3), and it can be concluded that, after maturation, errors which 
appear in the reproduction of the B-G figures are not attributable to chronological 
age nor intelligence but to other psychological differences. Finally, a Pearson r of 
-.10 between the B-G scores of the matched pairs in both groups indicated that 
equating did not lead to a significant correlation. 

To determine the validity of the dichotomy between delinquents and non- 
delinquents, a tetrachoric r was computed according to the Cosine-pi formula 
(3, p. 886) using the median B-G z score as the point for dichotomizing the two groups. 
The obtaining rte: of -.77 + .093 is more than eight times its 8. E. and highly signi- 
ficant. The efficiency of this dichotomization was improved by curtailing Pascal 
and Suttell’s questionable range, from :50-72 to 50-60. An upper cut-off score one 
S. D. below the mean of the delinquent group, or a z score of 60, excludes 95.4% of 
the control group. Of the delinquents, 20.9% would fall in the questionable range 
but the number of delinquents correctly classified would increase from 32.6% to 
69.8%. This increase in diagnostic prediction is significant at the .01 level. 


SUMMARY 
The Pascal and Suttell scoring method of the B-G Test has proved valid for 
differentiating between delinquents and non-delinquents. The use of a different 
questionable range than that reported by Pascal and Suttell is suggested for en- 
hancing the dichotomization of groups in diagnostic prediction. 
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PEAQ: A MEASURE OF PSYCHOPATHIC BEHAVIOR! 
MORTIMER H. APPLEZWEIG ANDREW 8S. DIBNER RAYMOND L. OSBORNE 
Connecticut College Clark University New York City 


INTRODUCTION 

Preu defines psychopathy as designating ‘“‘. . . those individuals who have mani- 
fested considerable difficulty in social adjustment over a period of many years or 
throughout life, but who are not of defective intelligence nor suffering from structural 
disease of the brain or epilepsy, and whose difficulties in adjustment have not been 
manifested by the behavioral syndromes which are conventionally referred to as 
neuroses and psychoses“: P- %),”” Despite a number of attempts to dismiss the 
category as a wastebasket classification, a useless conglomerate, a misclassification, 
ete.“ ©, the terms psychopathic personality, psychopathic inferior, psychopathic de- 
viate, constitutional psychopathic state, sociopathic personality, anethopath, etc., are 


‘Presented at the meetings of the Eastern Psychological Association in New York City, April 12, 
1957. 
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still widely used today for the classification of persons with behavior and/or adjust- 
ment problems. 

Tne present paper is a description of an attempt to psychometrically identify 
psychopathic behavior. The first form of the Personal Experience and Attitude 
Questionnaire? was developed some 13 years ago, during World War II, at an Army 
Air Force Regional Hospital in Colorado. The large number of referrals to an under- 
staffed clinic made the use of a screening battery of tests mandatory. The unique 
problems of military adjustment made the identification of psychopathy particular- 
ly important. Dr. Osborne, then chief of neuropsychiatry, was responsible for the 
initial selection of items for the scale, and Dr. Applezweig for the initial standardiza- 
tion. The Inventory was found useful in clinical work, but before it could be properly 
evaluated, changes in military assignments led to its temporary abandonment, and 
it was not taken up again until 1949, by Drs. Applezweig and Dibner at the Uni- 
versity of Michigan. Here a rewriting of the items and a shortening of the scale was 
undertaken on the basis of an item analysis of the earlier data and new data obtained 
on a large undergraduate male sample. — 


Brier DESCRIPTION OF THE PEAQ 
The PEAQ is a true-false questionnaire, consisting of 150 items, 10 of which 
constitute a validity scale and do not contribute to the total score. The 140 items 
composing the psychopathy scale concern life experiences or attitudes in six general 
areas, as follows: 
Criminalism: 25 items (e.g., I go for easy money, whether it s right or wrong. Two or more 
of my friends have been in jail. Most policemen are crooked and will accept graft.) 
Emotional instability: 20 items (e.g., My childhood home was not a happy one. People expect 
too much of me. I am more temperamental than most people.) 


Inadequate personality: 25 items (e.g., I’d rather save my money than spend it on extra com- 
forts. I have neither many friends nor any close friends. I don’t have any special goal or am- 
bition.) 

Sexual psychopathy: 15 items (e.g., There have been times in my life when I masturbated 4 or 
5 times a day. I have had sexual intercourse very often. I have known a prostitute quite well.) 

Nomadism: 15 items (e.g., If I get tied down to any one place or job for too long, I feel ner- 
vous or trapped. I am not the kind of person who likes to travel. I like to change my surroundings 
often.) 


“Other’’ psychopathic traits: 40 items (e.g., I have no habits that are considered very bad. I 
find it hard to make new friends to take the place of old ones. I have never been up before a judge.) 


The sub-scale items are randomly intermixed throughout the test. Although 
separate scores for sub-scales are readily obtainable, the present report will deal only 
with total test scores. Data for sub-scales cannot be analyzed meaningfully until 
separately diagnosed criterion groups are available for their validation. 


PROCEDURE 


PEAQ scores were obtained from five different subject groups, two of which 
were considered psychopathic and three were normal controls. A total of 209 Ss were 
tested at different times over the past eight years. All testing was done in group 
situations and all Ss were assured that scores would not be available to personnel of 
their own institution or situation. All groups were told that we were obtaining 
samples for development of a psychological questionnaire. The five samples tested 
are briefly described below: 


A. Diagnosed reformatory sample. Fifty-five inmates of the Ionia State Reformatory 
(Michigan) were tested during the Spring of 1949. These Ss had all been examined psychiatrically 
and those who had a primary diagnosis of psychopathic personality (with or without additional 
specification of type) were selected for the study. No other criterion of selection was applied. 
Ss ranged in age between 17 and 28, with a mean age of 21.00 + 2.83 years. Their average educa- 
tion was 8.3 years and average IQ as determined by the prison psychologist was 96.38. 


2Personal Experience and Attitude Questionnaire. Copyright 1956 by Mortimer H. Applezweig, 
Ph.D., Andrew 8. Dibner, Ph.D., and Raymond L. Osborne, M. D. Available for research purposes 
from Dr. M. H. Applezweig, Connecticut College, New London, Conn. 
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B. Diagnosed alcoholic sample. Forty patients in the Blue Hills Clinic of the Connecticut 
Commission on Alcoholism (both in- and out- patients) were tested in early 1950. The PEAQ 
was administered as part of a regular testing battery to patients referred to the staff psychologist. 
A diagnosis of primary or secondary alcoholism was used as the criterion for inclusion of an S’s 
score in this sample. Hewitt, Buhler and Lefever“) and Manson“) all justify the inclusion of 
alcoholic Ss in a psychopathic classification. Psychometric data (MMPI) further supports this 
identification“. Ss in this group range in age between 22 and 58, with a mean of 38.15 + 7.73 
years. Educational level ranged from 3 to 17 years of schooling completed, with a mean of 9.5 + 
3.0 years. 

A + B. The two samples described above, consisting of 95 Ss, constitute the psychopathic 
group. 

C. Trade School Sample. In the spring of 1950, 32 students in the Wilbur Wright Vocational 
High School, a Detroit (Michigan) trade school, were tested as the first of three control samples. 
The age range of this sample was from 16 to 19 years, with a mean of 17.4 + .82 years. Their 
education ranged from 9th to 11th grades, with a mean of 10.5 + .05 years of schooling. This 
group was chosen because a portion of the reformatory group (sample A) also were brought up in 
Detroit. 

D. Navy enlisted men sample. Twenty-four enlisted submarine candidates at the U. S. 
Navy Submarine Base in New London, Connecticut, were tested as part of a battery of experi- 
mental measures given in 1950 at the Medical Research Laboratory. The Ss in this group ranged 
in age from 17 to 23 years, with a mean of 19.17 + 1.65 years. The mean level of schooling com- 
pleted was 11.8 + .76 years, with a range from 10 to 14 years. 

E. College sophomore sample. A total of 58 college sophomore men were tested at Wesleyan 
University (Connecticut) in a classroom situation in psychology. The Ss were obtained in two 
successive years (1950, 1951), and the two samples combined when no differences were found be- 
tween their scores. The College group ranged in age from 19 to 26 years, with a mean of 21.21 + 
1.88 years. All were in their 14th year of schooling. 

C+D +E. The three samples, C, D & E constitute the 114 Ss of the control group. It 
should be noted that whereas a clear diagnosis of psychopathy or alcoholism was available in all 
cases of Ss included in groups A and B, respectively, no equivalent diagnoses of non-psychopathy 
were available for the control Ss. Their presence in the life situations in which they were found 
was taken as prima facia evidence of their representativeness of a typical adult male population, 
with no further investigation of freedom from psychopathy being made. 


RESULTS 
Reliability. Table 1 presents Pearson coefficients for the two halves of the 140- 


item questionnaire, split odd-even within sub-tests. The right-hand column shows 
the same coefficients after application of the Spearman-Brown Prophecy Formula”. 
For all samples, and for samples combined, the measure is found to be highly con- 
sistent internally. 


TaBLe 1. Means, Siamas, RanaEs, AND Spiit-HatF* RELIABILITY COEFFICIENTS FOR PEAQ Scores 


FoR Eacu SAMPLE AND FOR PsYCHOPATHIC AND CoNTROL SAMPLES CoMBINED. 








Sample ‘i Mean o Range Tn 





A 
Bb 


(Reformatory) 5! 20.47 14-106 
(Alcoholic) 18.79 16— 99 


A + B (Psychopathic) 19.80 14-106 





¢ 
D 
E 


(Trade School) : 19.42 14-104 
(Navy Enl. Men) : 2! 20.30 15— 86 
(College) f 12.96 7- 75 


C + D + E (Control) 17.00 7-104 





Total Sample 21.72 7-106 





*Split odd-even within sub-scales. | 
**Spearman-Brown correction applied. 


Comparison of psychopathic and control groups. Before comparing the psycho- 
pathic and non-psychopathic samples, Bartlett’s test for homogeneity of variance “? 
was applied, and the samples were found not to differ significantly in variance. 
Means, sigmas and ranges for total PEAQ scores for all groups are shown in Table 1. 
Table 2 shows results of an analysis of variance for the five samples treated separately 





PEAQ: A MEASURE OF PSYCHOPATHIC BEHAVIOR 


TABLE 2. ANALYSIS OF VARIANCE OF PEAQ Scores For Five SAMPLES 
(N = 209) 





Source Sum of Sqs. df Variance Est. 





Between groups 29686 .02 4 7421 .505 
Within groups 68871 .86 204 337 .607 


Total 98557 .88 208 F = 21.98 P = .01 








and Table 3 the t-test comparisons of each sample with each other sample. It can be 
seen from these tables that the groups differ significantly, with both psychopathic 
groups differing from all three control samples and not from each other. The differ- 
ences between psychopathic and control samples are all significant at better than the 
.001 level of confidence. 


TaBLe 3. Tests oF SIGNIFICANCE (t) OF DIFFERENCES BETWEEN GROUPS 


Group B Cc D E 








(Reform.) 4.42** 4.36** 
(Alcohol.) 3.94** 3.0" 


A 
B 
C (Trade Sch.) <i 
D 
E 





(Navy enl. men) 
(College) 





*significant at the .05 level of confidence. 
**significant at the .001 level of confidence. 


The only other difference which was significant at all was the one between the 
Trade School and College samples, and this at the .05 level of confidence. It is not 
surprising to find such a difference between these two groups for they are close to 
opposite poles in the make up of a normal group with its wide range of behavior in 
conformity or non-conformity to society’s standards. Examination of the differences 
between the groups indicated that the college group was higher principled and more 
controlled (less impulsive) than the trade school group, had fewer brushes with the 
law, was more considerate of other peoples feelings and less suspicious of others’ 
motives. They also had fewer nomadic impulses and professed to fewer direct hetero- 
sexual experiences, although a greater percentage admitted to frequent autoerotic 
sexual practices. Despite these differences between the groups they are both con- 
sidered as part of our normal sample. 

Application of a cutting score. Following a procedure developed by Guilford and 
reported by Manson“, we divided our total sample in half, by sub-samples, and de- 
termined a cutting score for psychopathy on half the population. The formula ap- 
plied here was the following: 


X, = M, + (.5 — p) o, 
M, M, (pq) 


Where X, = cutting score, M, = Mean of total sample used, M, = Mean of the 
psychopathic group, M, = mean of control group, and p and q the Ns of both samp- 
les, respectively. o,? = standard deviation of total sample. 

The cutting score of 47 thus obtained was applied to the second half of the 
sample, resulting in correct identification of 73.2% of the control group and 70.8% 
of the psychopathic group. The overall correct identification was 72.1%. Since the 
number of ‘‘misses’’ was found to be higher than desired, recourse was made to an 
item analysis, and a first approximation of a test revision undertaken. In examining 
the relative percentages of psychopathic and control group responses to each item, 
it was found that 16 of the 140 items actually reversed the expectation that the psy- 
chopathic group would score higher. Several other items showed little or no differ- 
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ence between the two groups. The 16 reversing items were dropped from the total 
scores and a new cutting score derived. Applying the new cutting score improved 
control group identification to 80.4% and psychopathic group identification to 
77.1%, with an over-all correct identification of 78.8%. It can certainly be expected 
that further elimination of non-discriminating items will improve the predictive 
efficiency of the measure. As with any use of clinical judgment as a criterion for the 
validation of a psychometric, a portion, at least, of the error remaining may be at- 
tributable to the lack of validity of the criterion. Since the control group may con- 
tain a few psychopathic individuals, the “false-positives’’ here may not all be false. 


DIscussIon 


Treating the 140-item psychopathy scale as a whole, we have found it to have 
high internal consistency, and to give evidence of validity for the discrimination of 
psychopathic from normal persons. 

As indicated earlier, the PEAQ is composed of a series of items touching on 
varieties of asocial and anti-social behavior, from the more overt sexual and criminal 
acts to more subtle indicators of lack of impulse control, restlessness, lack of ethical 
standards, suspicion of authority, lack of goal persistence, lack of real personal ties, 
and so on. Whether these do indeed form a meaningful syndrome of ‘“psycho- 
pathy” cannot be determined at this time. Use of an inventory such as the PEAQ 
and careful analysis of the data obtained with clinical groups should do much to help 
clarify the nature of any syndrome or syndromes involved. 

Rubinstein and Lorr®, for example, recently reported the successful use of a 
sample of 50 items taken from the PEAQ in the identification of patients who were 
to become early terminators in therapy. The items used by Rubinstein and Lorr 
were drawn largely from our Nomadism, Emotional Instability and Inadequacy 
scales. To a limited degree, their study offers evidence in support of the usefulness 
of the PEAQ in research on prediction problems where the so-called psychopathic 
traits of lack of impulse-control, hostility toward authority, restlessness and nomad- 
ism, and the level of goal-directed persistence toward culturally-approved goals 
seem crucial. 


SuMMARY AND CONCLUSIONS 

A 140-item true-false screening questionnaire for psychopathic behavior, the 
Personal Experience and Attitude Questionnaire, is described. Total score on the 
measure is highly reliable and discriminates at a highly significant level between 
diagnosed psychopathic and control samples. The use of a cutting score permits cor- 
rect identification of 77% of the psychopathic group and 80% of a control sample. 
There is good promise of even better prediction with an empirically revised scale. 
The use of sub-scale scores for criminalism, emotional instability, inadequacy, 
sexual psychopathy and nomadism is considered possible only when validating in- 
formation on these sub-scales becomes available. 
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PROBLEM 


Several reports have appeared recently of attempts to select patients who would 
be continued in treatment by the therapist, and to predict the outcome of treat- 
ment: 2), These studies were made on both outpatient and institutionalized groups 
and appear to have considerable promise for screening ‘drop-outs’. The present 
paper reports an attempt to develop a self-administering inventory for selecting the 
individuals among a drug-addict population who are most acceptable for psycho- 
therapy. Specifically, this paper (a) defines a criterion of acceptability for psycho- 
therapy, (b) describes an inventory designed to predict this criterion, (c) presents the 
original cross-validation of combinations of items from the inventory, (d) indicates 
how the final acceptability scale was constructed, (e) includes normative data based 
on a large number of subjects and presents correlations between the scale and other 
=" and (f) reports validity generalization tests of the scale on a different 
sample. 


THE CRITERION 

Since ‘‘acceptability for psychotherapy’’, as defined in a previous study “’, was 
employed as the criterion for the reliability and validity studies to be reported here, 
the original study will be described briefly. Each of three psychotherapists inter- 
viewed approximately one-third of 100 male narcotic addicts who comprised the 
subject group, and during three one-hour interviews “ .. . , judged the acceptability 
of each patient for psychotherapy.’’ These judgments constituted the criterion 
which was used for calculating the predictive power of a battery of psychological 
techniques. The psychotherapists found that the more acceptable patients showed 
a greater ability to verbalize feelings, spontaneously produced more problems for 
discussion, gave more evidence of original thinking, were more apt to ask for aid 
with their problems, and showed more evidence of previous ability to modify their 
patterns of living. 


DESCRIPTION OF THE INVENTORY 


The original pool of 180 items from which the inventory was derived was largely 
determined by hypotheses from other investigations of addict behavior“: * ® and 
from several years of clinical association with hospitalized addicts, on the supposi- 
tion that the more “‘normal’’ individuals (in general, the less psychopathic) would be 
considered more acceptable for psychotherapy. Specifically, it was hypothesized 
that the more acceptable individuals would show (a) a greater degree of general 
motivation, 7.e., they would produce more positive responses to items having an 
‘{nterest”’ or ‘‘satisfaction’’ content; (b) fewer positive responses to items which re- 
flected the desirability of the drug condition; (c) less hostility toward authority and 
fewer antisocial and asocial attitudes; (d) less evidence of broken homes and ex- 
tremely harsh or pampered rearing; (e) more sympathy and empathy; and (f) more 
interest in obtaining help in overcoming their problems. In order to minimize the 
semantic barrier, items of the inventory were written in language commonly used by 
addict patients“. 


Cross validation on original sample. The inventory was administered in small 
groups to 100 institutionalized addict patients who were subsequently judged 
acceptable or unacceptable for therapy. Upon termination of the study the group 
was divided into random halves to form two subgroups. An item count for each 
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sub-group was made to isolate the items which were significantly related to the 
criterion. Using items which produced ¢ values for the difference between proportions 
that exceeded 1.5, empirical scoring keys were constructed to reflect the direction of 
response (true or false) of the acceptable subjects. The key developed for one sub- 
group was used to predict acceptability for psychotherapy in the other subgroup, 
and vice versa. 

This procedure produced agreement between the inventory results and the 
judgment of the therapists in 74 percent of all cases. If the inventory had been used 
to refer to therapy those patients scoring in the upper quartile, 78 percent of such 
referrals would have proved acceptable for psychotherapy as indicated by the 
criterion. 


Construction of acceptability scales. Seventy-eight of the 180 items of the in- 
ventory yielded the same direction of response in the two sub-samples and produced 
tetrachoric correlation coefficients with the criterion which were statistically sig- 
nificant at the .05 level of confidence. Two senior psychiatrists of the hospital staff, 
neither of whom had participated in the initial study, scored the 180 items to reflect 


TaBLe 1. Items or AccepTaBILity ScaLe C. SHowine DirReEcTION OF RESPONSE OF ACCEPTABLE 
SuBJEcTS 








I usually have a good appetite. (T) 
Yd well-adjusted people become addicted. 
) 


The only thing wrong with addiction is that 
there is a law against it. (F) 

I feel most like quitting drugs for good when 
kicking a habit. (F) 

I have never been in love with anyone. (F) 
Marriage has only (or would only have) inter- 
fered with my activities. (F) 

There should be more community clinics to 
treat addicts after they leave the hospital. (T) 
There is nothing wrong with me (except that 
I use drugs). (F) 

Many of my dreams are about sex. (T) 


As long as I can have drugs I am not interested 
in help to overcome my habit. (F) 


I think most people tell lies to get ahead. (F) 

I am a good mixer. (T) 

i I did not have to associate with addicts. 
(T) 


I had strong affection for my mother. (T) 


There is nothing I could not do if I had drugs 
all the time. (F) 


I know I am different from the normal man but 
I don’t know what the difference is. (T) 


I finished high school. (T) 
All addicts are pretty-much alike. (F) 


I am in favor of making drug use legal for real 
addicts. (F) 


The main reason for coming to this hospital was 
that my habit was getting too big and too ex- 
pensive. (F) 


I had psychiatric help on the outside. (T) 


I am likely not to speak to people until they 
speak to me. (F) 


I like to sit around and enjoy a shot. (F) 
I will never be satisfied with a routine job. (F) 


I liked school. (T) 

I think a psychiatrist could help me. (T) 

Most people are honest because they are afraid 
of being caught. (F) 

Most people make friends because friends are 
likely to be useful to them. (F) 

Our society should be changed so that an addict 
can get what is due him. (F) 

I wanted to learn a profession but something 
always came up to stop me. (F) 

im had some education beyond high school. 


Drug addiction can be inherited. (F) 

Any addict with will power should be able to 
give up drugs on his own. (F) 

I wish I were not so shy. (T) 

My parents were living together when I left 
home (or, are still living together, if you still 
live with them). (T) 

I playea hooky from school quite often as a 
youngstér. (F) 

The problem of drug addicts would be solved if 
doctors could — a drug which relieves 
pain, but is not habit-forming. (F) 

If a doctor were called to see me when I was 
seriously ill (and did not have a habit), I would 
tell him that I had been addicted to drugs and 
for him to be careful with what he gave me. (T) 
A drug addict is more respectable than an al- 
coholic. (F) 

An addict never does abnormal sex acts when on 
junk. (F) 

Sometimes I take myself too seriously. (T) 
admit, I wish I were a “Square John’’. 


I rarely have intercourse when I am not taking 
drugs. (F) 

There are very few things that interest me. (F) 
If I have been off drugs for awhile, I feel a little 
sick just before taking the first shot. (T) 
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the direction of response which they judged would characterize acceptable thera- 
peutic candidates. It was found that of the 78 items that empirically differentiated 
acceptable and non-acceptable subjects in the original cross-validation, 46 were 
agreed upon by the psychiatrists. These items (scale C), which met both empirical 
and rational requirements, are presented in Table 1 together with the direction of 
response of the acceptable patients. For calculating indices of reliability for this scale, 
the 46 items were used to construct two 23-item scales (Scales A and B) by assigning 
items to each that were paired for both content and validity coefficient. 


Normative data and correlations with other variables. The inventory was ad- 
ministered to 400 institutionalized narcotic addicts who remained in the hospital a 
sufficient period to become screening problems. The distribution of raw scores on 
Scale C.is presented in Figure 1. As can be seen, these scores approximate a normal 
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FicureE 1. FREQUENCY DISTRIBUTION OF Raw ScorEs ON ACCEPTABILITY 
ScaLE C OBTAINED BY 400 INSTITUTIONALIZED Narcotic ADDICTS 


distribution, with the fiftieth percentile falling at 26, the eighty-fourth at about 31, 
and the sixteenth at 21. The scaling methods described by Johnson“ were then 
used to transform the raw data to T-scores giving a normal distribution with a mean 
of 50 and a standard deviation of 10. Raw scores and equivalent T-scores are pre- 
sented in Table 3. 

A product-moment coefficient of reliability of .67 (N = 400) was obtained for 
scale C by correlating Scale A with Scale B and converting to full length by using the 
Spearman-Brown prophecy formula. Reliability of the full-length scale is somewhat 
greater in sub-samples of more highly motivated subjects. In 101 patients, who had 
requested psychotherapy, a product-moment coefficient of .75 was computed. 
Furthermore, reliability of the 46-item scale is not affected by removing “buffer’’ 
questions from the inventory. A brief questionnaire comprised of the 46 keyed items, 
plus 14 others designed to measure test-taking attitude, was administered to a 
second sample of 101 patients who had requested psychotherapy, and a product- 
moment correlation coefficient of .73 was obtained. 

All patients in this study were male. The applicability of the scale to female 
addicts has not been tested. The differences between the means of White and Negro 
subjects were statistically negligible. However, there was a statistically significant 
difference between the standard deviations of these racial groups, the negroes show- 
ing less variability. A tetrachoric coefficient of correlation between chronological 
age and acceptability test score in 200 patients was +.30; this was statistically sig- 
nificant beyond the .01 level of confidence. In the same sample, however, the co- 
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TaBLeE 3. AccepraBiLtity Test Scores OpTaINnep BY THREE Groups OF INSTITUTIONALIZED 
Narcotic Appicts. (T-Scorges Basep oN RESPONSES oF N = 400.) 








In Individual Group Therapy Not Selected 
Raw Therapy (3 or (3 or more for Therapy or 
T-Scores Scores more interviews) interviews) “dropped out’’ 
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efficient of correlation between age and selection-rejection for therapy by means of 
psychiatric staffing was nonsignificant (—.09). 

A statistically significant relationship was found between the acceptability scale 
and certain test variables of the MMPI and the Cattell Sixteen Personality Factors 
Test °°) as shown in Table 2. The significant tetrachoric correlations found between 
the scale and the more “healthy” variables, such as Responsibility, Mf, and Cattell’s 
Factors A and N, tend to support the writers’ hypothesis that the less psychopathic 
and more normal subjects would score higher on acceptability for psychotherapy. 
Even the significant correlations found between the scale and Neuroticism add sup- 
port to this position if it is assumed that, in general, the neurotic and anxious in- 
dividuals in the addict population profit more from therapy than the individuals 
labelled psychopathic character disorders. 


Validity generalization. Between October 1955 and April 1956 the psychiatric 
staff of the hospital selected 94 patients for psychotherapy independently of scores 
on the acceptibility scales. Eight of these patients were dropped from therapy within 
one or two interviews, 43 of the remaining 86 patients were assigned to individual 
psychotherapy, and 43 were assigned to group psychotherapy. These subjects con- 
stituted the experimental groups in the validity generalization study. The inventory 
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TaBLE 2. CoRRELATIONS OF THE ACCEPTABILITY SCALE WITH OTHER TESTS 














MMPI Scales (N = 93) Cattell 16 P. F. Test Factors (N 
Test Factor 





Warm, sociable 
Sophisticated, polished 
Sensitive, effeminate 
Bright, intelligent 
Experimenting, critical 
Conscientious, persistent 
Eccentric, unconcerned 
Tense, excitable 


Responsibility 
Mf 


Neuroticism 
Dominance 

Anxiety Index 
Psychopathic deviate 
Hypomania 
Hypochrondrism 


Qz0ORm Zp 
.seererr 








*Significant at the .05 level of confidence 
**Significant at the .01 level of confidence 


was administered routinely to these subjects, and was administered concurrently to 
42 other patients who were admitted to the hospital during the same period of time 
and who received psychiatric staffing but were not selected for psychotherapy. These 
42 patients, plus the 8 who were dropped from therapy within one or two interviews, 
provided the control group for therapeutic selectees. 

The predictive utility of Scale C can be calculated from Table 3. Although the 
mean T-score for the group which was chosen for individual therapy was slightly 
higher than that of the group chosen for group therapy, the difference was not statis- 
tically significant. The mean scores for the therapy groups, however, were significant- 
ly higher than that of the controls, the differences between means being statistically 
significant at the .001 and .01 percent levels respectively. 

If it is assumed that the ratio of acceptable to unacceptable therapeutic candi- 
dates is 43 to 50 (.46),! the data in Table 3 provide an estimate of the relative effic- 
iency of the scale in selecting patients for individual and group therapy. A cutting 
point of 56 “picks up’’ 58 percent of the individual therapy assignees, and 40 percent 
of the group therapy assignees, while picking up only 12 percent of the controls. 
Seventy-seven percent of a sample of patients selected in this way would be accept- 
able for either individual or group therapy in contrast to the acceptability base-rate 
of .46 in the addict population. If a cutting point of 60 were used, 85 percent of the 
test-selected sample would be acceptable for psychotherapy. 

The present inventory was designed specifically for use with narcotic addicts. 
However, a preliminary investigation which used parallel forms of this inventory 
with addicts and alcoholics has indicated that it may be applicable to the study of 
the alcoholic. Differences were found between the two groups in responses to items 
designed to measure sensitivity to pain, kind and degree of hostility and aggression, 
as well as between items concerning preferred drug effects. 


SUMMARY 


A questionnaire of 46 items was constructed out of a pool of 180 items for select- 
ing individuals among a drug-addict population who were most acceptable for psy- 
chotherapy. The criterion of acceptability was the judgment of psychiatrists made 
during three one-hour interviews with each patient. The first study utilized 100 
institutionalized narcotic addicts after withdrawal treatment had been completed. 

An item count was made to find the items which were significantly related to the 
criterion. Cross validation demonstrated that combinations of items from the 


1In a recent study“) psychiatrists estimated that 46 per cent of the addict patients that were 
interviewed were acceptable for psychotherapy. Thus, each therapy group (Table 4) should be com- 
bined with the control separately to calculate per cent efficiency above a particular cutting point. If 
therapy groups are combined, as was done above, the control group should be doubled, since it is 
assumed that 50 unacceptable candidates were rejected in the process of selecting each of the therapy 
groups. 
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questionnaire were 74 percent efficient in predicting psychiatric judgments of ac- 
ceptability for psychotherapy. For purposes of further refinement, 78 items that 
differentiated between the more acceptable and the less acceptable patients in the 
initial study were reduced to 46 by selecting only those empirically valid items which 
two senior psychiatrists, neither of whom had participated in the original cross- 
validation, independently judged to reflect the responses of acceptable therapeutic 
candidates. 

Normative data for the inventory on 400 subjects were obtained. Validity 
generalization tests of the predictive efficiency of the 46 item scale were then made 
on a group of 136 subjects, some of whom had been independently chosen to receive 
psychotherapy, and some of whom had not been so chosen. When the scale was used 
with this sample to label those patients scoring more than one standard deviation 
above the population mean as “acceptable for therapy’’ most were those for whom 
psychotherapy had been prescribed. 
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A NEW SORTING TECHNIQUE FOR DIAGNOSING 
BRAIN DAMAGE! 


JOHN ALTROCCHI AND B. G. ROSENBERG 
University of California® Bowling Green State University 


PROBLEM 

Since the pioneering work of Goldstein“, evaluation of abstract behavior by 
means of sorting ability has been of central importance in the psychodiagnosis of 
organic brain damage®. There is, however, a scarcity of quick, easy, quantitative 
sorting tests available “* ™. This report concerns an investigation of a new sorting 
technique which may help meet this need. 

The technique was developed by Yacorzynski“": '*), There are 16 blocks which 
the subject is asked to sort into four groups of four blocks each.* The unique feature 


1Statistical analysis was facilitated by a grant from the Scholarly Advancement Committee, 
Bowling Green State University. 

Most of this investigation was carried out at the V. A. Hospital, Palo Alto, Calif., and the V. A. 
Hospital, Oakland, Calif. We gratefully acknowledge the considerable advisory help ‘of Drs. Albert 
Kostlan, Paul McReynolds, William Grove, and Richard Sears. 

2N ow at Duke University. 

*Dittoed copies of instructions and scoring criteria developed by the authors can be obtained by 
writing the senior author, Department of Psychiatry, Duke University. 
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of the test is that the blocks can be sorted eight different ways: (1) shape of block, 
(2) shape of figure (inscribed on the top of the block), (3) color of block, (4) color of 
figure, (5) height, (6) size or volume, (7) area of the top of the block, and (8) area of 
the figure. The subject is asked to keep sorting the blocks in new ways until he gives 
up, spends ten minutes without achieving a sorting, or repeats a category five times. 
Yacorzynski suggested that non-organics over ten years of age can achieve at least 
four sortings and that many organics cannot achieve four or even three. 

The authors undertook extensive pretesting of the technique. It was found to 
be interesting to patients and administrable in an average of twenty minutes. A 
detailed but objective scoring system was developed to give partial credit for partial 
solutions and to take account of complexity of verbalization. Scores range from zero 
to eight in one-third-unit steps (7.e., three-plus, four-minus, four, four-plus, etc.). 
Administration of the technique to 18 hospitalized organic patients, 14 hospitalized 
psychiatric patients, 13 aphasic outpatients, and 21 normals suggested that the tech- 
nique was reliably able to separate organics from others. Therefore, a more rigorous 
investigation was undertaken. 


First Vauipiry Strupy 


The first study was designed to discover if the test could differentiate organics 
from schizophrenics“. This study was conducted in a large V. A. neuropsychiatric 
hospital. In view of evidence from recent investigations: *: 4: 5 18) that “process” or 
long-term, chronic schizophrenics may have organic damage, as differentiated from 
“reactive’’ or acute schizophrenics, three groups of subjects were needed. A group 
of reactive schizophrenics was unfortunately not available. The organic group was 
composed of 19 hospitalized male patients with a primary diagnosis of syphilis, ter- 
tiary, meningo-encephalitis (paretics). The ward physician individually selected 
19 schizophrenic patients (all presumably of the process type) to match the organic 
patients in age, length of illness, and severity of malfunctioning. 

Since it seemed necessary to discover the effect of intelligence on the Yacorzyn- 
ski Block Technique (YBT), three Wechsler-Bellevue (WB) subtests were selected 
which seemed to give broadest coverage, maximum reliability“, and to be minimal- 
ly affected by brain damage“? ): Information, Comprehension, and Picture Com- 
pletion. A part-whole correlation was run between the total weighted scores on the 
three subtests (hereafter referred to as WS) and WB total scores for a sample of 74 
mixed psychiatric patients.‘ The resulting correlation of .89, albeit spurious, was 
sufficient to suggest that these three subtests estimate WB IQ for this kind of popu- 
lation with satisfactory accuracy. 

The subjects were administered the three WB subtests and the YBT “blind”, 
so that the administrator of the YBT did not know whether the subject was organic 
or schizophrenic. 


Results. The results of the matching and testing are shown in Table 1. Age was 
successfully matched. Length of illness was significantly different for the two groups, 
but the difference between 20 and 27 years of illness suggests that this was not a 
differentiating factor. Though the schizophrenic group did slightly better on the 
YBT, the difference is not significant, most patients in both groups performing 


TaBLE 1. Resvuts oF THE First Stupy 





Mean Age Mean WSf Length of Concepts 
Group Tilness (years) x 8.D. 





— , 59.1 18.3 19.68 1.789 1.472 

N = 19 

Schizophrenic 60.0 21.4 27 .26* 2.579 1.350 
(N = 19) 





*Significant at the .01 level of confidence. {Total weighted scores of three subtests. 


‘We are grateful to Dr. W. T. Smelser for making these data available to us. 
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within the suggested organic range. Thus, if these elderly, deteriorated schizophren- 
ics were indeed non-organic, the technique failed in its purpose. If, however, as 
seems more likely“: *: *: 5 18) they should be viewed as organic, the technique per- 
formed adequately so far. It needed to be demonstrated that non-organics score 
higher on the YBT. 


Seconp VALipiTy Stupy 


The Chief Neurologist of a large V. A. General Medical and Surgical Hospital’ 
selected from the neurology ward forty patients who were either demonstrably 
organic (central nervous system pathology rostral to the foramen magnum) or non- 
organic (no such pathology).* Each patient had had a complete neurological work-up, 
including nevrological examination and routine laboratory procedures and, when 
necessary, EEG, lumbar punctures, and skull X-ray. These patients were seen 
“‘blind”’ by the authors and given the three WB subtests and the YBT. The authors 
did not know the proportion of organic versus non-organic patients until the com- 
pletion of the study. This method is similar to that of Fisher, Gonda, and Little, 
the research situation being as representative of the clinical situation as possible. 
The design follows the suggestion of Piotrowski“® that the best control group for a 
group of organics is a group of patients who were suspected of organic damage but 
later found to be non-organic. 


Results. The criterion yielded a group of 28 organics and 12 non-organics. Significant 
differences were not found in age (organic: X = 40.39 years, 8.D. = 13.50; non-or- 
ganic: X = 40.16 years, S.D. = 11.61) or intelligence estimate (organic: Xws = 
31.04, S.D. = 5.98; non-organic: Xws = 32.50, S.D. = 5.73). Table 2 presents the 
agreement of the YBT and the criterion diagnosis. In view of lack of empirical 
norms, three sets of predictions were made from the data before the criterion was 
known: (a) Difference analysis, where patients were diagnosed organic if their YBT 
score was lower than expected from their intelligence estimate; (b) Difference plus 
qualitative, using the difference method plus any qualitative signs that had been 
noted during the administration of the YBT; (c) Cut-off four-minus, a simple mech- 
anical method ignoring intelligence estimate and diagnosing as organic anyone scor- 
ing below four-minus. After the criterion was known, a fourth method was tried: 


TaBLe 2. Tue RELATIONSHIP BETWEEN Four INTERPRETIVE SYSTEMS FOR THE YBT AND THE 
CRITERION DIAGNOSIS FOR THE SECOND Stupy 








Criterion Criterion 


NO O Sum O Sum 
Difference 1 15 16 Difference + oO 14 15 
Analysis 11 13 24 Qualitative NO 14 25 
12 


28 40 ‘ 28 40 


-408 
.O1 





Criterion Criterion 
O O Sum 
Cut-off O 15 20 8 
Four-Minus NO 13 y 8 17 
: 28 y 28 40 
@ = .342 = .430 
P <.05> .02 = <.01 








5We are extremely grateful to Dr. J. K. Smith of Oakland V. A. Hospital for his careful handling 
of this crucial role in the study. 

‘We realize that this is an arbitrary dividing line, but current neurological knowledge and re- 
search precedent seem to make it a reasonable one 
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(d) Cut-off four,whereby any score less than four was considered organic. As can be 
seen from Table 2, all four systems correlated significantly with the criterion. Inter- 
preting Table 2, all four systems agree highly with the criterion when they identify 
a patient as organic (94%, 93%, 88%, and 87%). When the YBT fails to identify 
a patient as organic the results are inconclusive (46%, 44%, 43%, and 53%). 

The data were also analyzed for information on the influence of age and general 
intelligence on YBT score. For the second study the correlation between YBT score 
and age was —.199 (N = 40, P = >.10), so that age does not appear to be of critical 
importance. The correlation between YBT score and WS estimate of intelligence 
was .69 for the first study, where the range of both scores was restricted, and .56 for 
the second study where the range was more representative. It is worthy of note that 
despite the significant (P = <.01) relationship, WS alone failed to distinguish be- 
tween the organic and non-organic groups in the second study, and diagnosis on the 
basis of difference between YBT and WS (difference analysis) was not significantly 
more accurate than diagnosis from the YBT alone. 

One further type of analysis was undertaken. Table 3 shows that non-organics 
take less time to achieve their first and second concepts. It may therefore be possible, 
as more normative data is collected, to increase the diagnostic accuracy of the test by 
weighting the scoring for time used. 


TaBLeE 3. Time (IN Ssconps) Requrrep To ACHIEVE CONCEPTS IN THE 
Seconp Strupy 





YBT Concepts Organic Non-organic t 4 

203 .33 oy .58 2.055 < .05 

258.16 4.981 <.0l 
9 7174 


-2450 
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As a step toward the development of norms, the distribution of YBT scores for 
all subjects in both studies is presented in Table 4. 


TaBLE 4. FREQUENCY oF ScorES ACHIEVED BY ALL SUBJECTS ON THE YBT 


YBT Score 
Groups 0+ 1- 1+ 2 2 2+3 3 3+ 4 4 44+ 5& 55 


Paretic 3 3 2 3 2 2 
N = 19 














“Process”’ 
Schizophrenic 
N = 19 





Organic 





Non-organic 
N = 12 








Discussion 
The YBT is a relatively short, simple, interesting, and unthreatening method 
for evaluating sorting ability. The range of solutions allows for “success” for low- 
level subjects and challenge for even the brighest subjects. The scoring is objective, 
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quantitative, and requires little training. The technique also allows for observation 
of many qualitative cues, such as repetition of sortings, for evaluating a patient’s 
handling of a concept formation task. The YBT, even without control for intelli- 
gence, discriminated many organics from other patients while misclassifying very 
few non-organics as organic. A test with this characteristic may be included in a 
battery of tests for organicity without the danger of many false diagnoses. The use 
of the conservative cut-off of four-minus is recommended for clinical situations. This 
tentative demonstration of the YBT’s promise, however, calls for extensive cross- 
validation before it can be established as a valuable diagnostic tool. 


SUMMARY 

The Yacorzynski Block Technique allows a subject to use eight categories of 
shape, color, size, and area in sorting 16 blocks into four groups of four. The test is 
objectively scored by the number of sortings achieved. Tentative norms and scoring 
criteria were developed in pretesting. The YBT and three WB subtests were admin- 
istered to four groups of hospitalized male patients. The technique did not discrimin- 
ate between 19 deteriorated ‘‘process’’ schizophrenics (sometimes considered or- 
ganic) and 19 deteriorated paretics matched for age and severity of malfunctioning. 
It significantly discriminated, in a manner useful for clinical diagnosis, 28 organic 
from 12 non-organic patients on a neurology ward. Time to achieve the first two 
sortings was significantly different for the last two groups. The test thus tentatively 
appears useful in the detection of organicity. 
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AN EVALUATION SCALE FOR SHOCK THERAPY 
MARVIN J. FELDMAN 
University of Buffalo 


The present study describes the empirical derivation of an objective test de- 
signed specifically to evaluate the results of shock treatment. It may, however, be 
useful in assessing any somatic therapy with psychotic patients. Further it may 
enhance our knowledge of the verbal changes which accompany successful treatment. 


DERIVATION OF THE EVALUATION SCALE 


The study was part of a larger project to construct empirically both prognosis 
and evaluation scales for shock treatment. The Minnesota Multiphasic Personality 
Inventory (MMPI), given before and after treatment, furnished the initial pool of 
items for constructing the two scales. The prognostic scale (Ps) ® has been des- 
cribed in detail elsewhere. The evaluation scale complements the work on prognosis. 
The same groups were used in constructing both scales except that some patients 
were not-tested after shock so had to be eliminated as subjects for the evaluation 
scale. Certain procedures for selecting cases are common to both scales. Since the 
article on prognosis describes the selection and personal characteristic of the groups 
in = only the general outline of these matters will be repeated in the present 
study. 

The general plan for the evaluation scale was to select criterion groups of re- 
covered and unimproved patients. An item analysis was performed on the pre- and 
post-shock MMPI records of these groups. The item analysis was somewhat un- 
usual in requiring the following three steps: 


1. An initial pool of items was selected which differed significantly between the 
pre-and post-shock records of the recovered group. From this pool, items were 
eliminated for either of the following two reasons. 

. There was also a significant change in the same direction in the pre-and post- 
shock unimproved group. 

. There were no significant differences between the item counts in the post- 
shock recovered group and the post-shock wnimproved group. 


This procedure should produce items reflecting changes in verbal behavior accom- 
panying improvement. 

The recovered and unimproved criterion groups had been classified on the basis 
of both psychiatric ratings at the conclusion of therapy, just prior to discharge from 
the hospital, and one year follow-up ratings. In selecting cases, a restriction was im- 
posed to the effect that the pre-shock MMPI record had to deviate sufficiently from 
normality. The groups were fairly typical of hospital patients used in studies of this 
sort. They were slightly above average in intelligence; the average age was in the 
middle thirties; there was a slight preponderance of men over women, and the 
course of treatment was similar in both groups. The greatest discrepancy was in 
terms of diagnosis; the recovered group contained a preponderance of affective dis- 
orders, the unimproved group a preponderance of schizophrenics.' 

The two criterion groups each contained thirty cases. From the pre- and post- 
shock MMPI records, item counts were tabulated separately for each group. After 
this tabulation, the first step was to cull items which changed by 23.3% (approxi- 
mately the five percent level) in the pre- to the post-shock records of the recovered 
group. Of the total 550 items, 131 changed in this manner. Eight of the 131 items 
were eliminated because they also changed significantly from pre- to post-shock in 


1The reader is referred to the monograph on prognosis (2) for an extended description of the 
selection and composition of the criterion groups. 
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the unimproved group. Another 55 items were eliminated because they did not 
differ significantly from the item counts in the post-shock records of the unimproved 
group. The five percent level was used for each of the latter two steps. The remain- 
ing 68 items comprised the first edition of the evaluation scale. The scale was scored 
so that a high score accompanied improvement. 


VALIDATION OF THE EVALUATION SCALE 


The first step in establishing validity was to apply the scale back to the criterion 
groups. Two conditions must hold if the test is to be discriminating: (a) A signi- 
ficant difference between pre- and post-shock scores of the recovered group, and (b) a 
significant difference between the post-shock scores of the recovered and unimproved 
groups. Table 1 shows that both requirements are fulfilled satisfactorily in the 
criterion groups, for the evaluation scale separates the groups with a high degree of 
success. Table 1 also indicates that the pre-shock scores of the recovered and un- 
improved groups were quite similar and scores of the wnimproved group do not change 
on the post-shock test. These latter findings would seem to reflect desirable properties 
of an evaluation scale. 


TABLE 1. APPLICATION OF THE EVALUATION SCALE TO THE CRITERION GROUPS 





Criterion Groups Mean SD diff. CR \ 





30 post-shock recovered 55.2 9.0 
2 24.6 7.29 .0001 
30 post-shock unimproved 30.6 14.7 





30 pre-shock recovered 29.8 12.4 
30 post-shock recovered 55.2 9.0 





30 pre-shock unimproved 24.0 11.0 
30 post-shock unimproved 30.6 14.7 
30 pre-shock recovered 29.8 12.4 





30 pre-shock unimproved 24.0 11.0 





The next step was to cross validate the scale on new cases. Of the original one 
hundred cross validation cases used in the prognosis study, only sixty-three had post- 
shock MMPI’s. These cases were similar to the criterion cases except that follow-up 
periods as short as six months were used in conjunction with the psychiatric ratings 
at the conclusion of treatment to assign patients their recovery status. The cross 
validation cases also included an intermediate group of cases rated improved. The 
comparisons of the results of the post-shock scores of the cross validation cases are 
shown in Table 2. 

The evaluation scale has some degree of success in separating the recovered 
group from each of other two groups, but this unbiased estimate of validity produces 
considerably lower critical ratios compared with results from the criterion groups. 
Scores of the improved group are higher, but not significantly so, than the wnimproved 
group. This latter finding is consistent with results obtained from the prognosis 
scale. The improved group may be closer to the unimproved than the recovered group, 
clinically as well as psychometrically. That is, the improved group may be labeled 
such because they make a satisfactory social adjustment. Nevertheless they may 
still retain a deep layer of psychopathology so that upon closer analysis they may not 
differ too much from the unimproved group. An evaluation scale would be more use- 
ful, however, to the degree that it can make finer discriminations among groups. The 
value of the present scale is impaired by its inability to separate scores of improved 
and unimproved groups. 
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TABLE 2. APPLICATION OF EVALUATION TO THE Post-SHock MMPI Recorps 
OF THE Cross VALIDATION GROUPS 








Test Groups Mean SD diff. CR P 





26 recovered 53.9 6.9 
19.2 4.47 .0001 
11 unimproved 34.7 12.6 





26 recovered 53.9 6.9 
13.2 
26 improved 40.7 13.4 





26 improved 40.7 13.4 
6.0 1.28 .20 
11 unimproved 34.7 12.6 





An inspection of the data reveals that the recovered group are separated most 
accurately with little variability. The improved cases vary widely on the scale which 
may reflect the clinical heterogeneity of this group. A sizable minority of unimproved 
cases have favorable scores thus impairing the value of the scale. . Perhaps these 
patients register transient gains in mental health which result in high scores. 


Fina Item ANALYSIS 


Previous work“) suggests that gains in validity are potentially marked as cri- 
terion groups are increased from 20 to 40 cases. Hence a final form of the scale was 
constructed with expanded criterion groups. Ten former cross validation cases were 
used to increase the recovered and unimproved groups to 40 cases in each group. The 
item analysis produced 62 items of which 44 overlapped with items on the prelimin- 
ary scale. The final scale was labeled ‘““EVS”’ to signify its purpose, “‘evaluation of 
shock therapy.’” All items on both forms are answered in the ‘‘plus’’ direction by the 
unimproved groups and of course in the opposite direction by the recovered group. A 
“‘plus’”’ answer is by definition the statistical minority response of the original norma- 
tive sample for the MMPI. The 62 items are listed in Table 3 according to their 
numbers as given in the back of the revised manual of directions, 1943. © 


TaBLeE 3. Tue 62 MMPI Items Compristne THE “EVS” Scag. T or F 
INDICATES THE DIRECTION OF RESPONSE BY PATIENTS WITH UNFAVORABLE 
Psycutatric Status Arrer THERAPY 


A-8 (F) F-10 (T) F-55 (T) I-23 (T) 
F-11 (F) 1-27 (T) 
F-15 (F) ‘ 1-30 (T) 
F-17 (T) I-31 (T) 
F-35 (T) I-32 (T) 
F-36 (T) 7 1-34 (T) 
F-37 (F) . I-35 (T) 

~:F-39 (F) I-37 (T) 
F-41 (T) I-39 (T) 
F-43 (T) ) 1-54 (T) 
F-45 (T) J-4 (T) 
F-46 (T) : J-6 (T) 
F-48 (F) J-13 (T) 
F-49 (T) J-33 (F) 
F-50 (T) 

F-54 (T) 1-13 (T) 








EVS discriminates slightly better than the preliminary scale when applied back 
to the original criterion cases. No real cross validation was possible with the few 
remaining cases. It is reasonable to assume, however, that the validity of EVS is at 
least as great as the first form. Nevertheless, the value of EVS would seem to be 
greater for the light it may throw on the process of improvement rather than on its 
possible use as a clinical instrument. 


2Copies of the scale may be obtained by request from the author. 





MARVIN J. FELDMAN 


Item OVERLAP WITH OTHER SCALES 


The items on EVS overlap considerably with other MMPI scales, for as shown 
in Table 4 there is: (a) Considerable overlap with items on the neurotic scales espec- 
ially D, Hy, and Pv, (b) little overlap with items on several psychotic scales espec- 
ially Pa and Ma, and (c) little overlap with Ps but considerable overlap with the ego 
strength scale. The overlap forms a high percentage of the total items on a few scales; 
the highest being 46% with Pr. The items on the prognosis scale seem quite stable, 
for they are not the ones which change with shock treatment. To study further the 
relationship between Ps and Evs, scores on the two scales were correlated. On 63 
cases, the pre-shock Ps scores correlated only .33 with the post-shock EVS scores. 
Thus there is only a moderate but significant tendency for prognostic and evaluative 
scores to agree. While favorable Ps and EVS scores seem to go together, variable 
EVS scores tend to accompany poor Ps scores. 


TaBLe 4. ITEM OvERLAP BETWEEN EVS AnD OTHER MMPI Scaes 





Number Common Numbered scored Number scored 
Scale of Items Items Same by recovered Same by unimproved 





15 0 
64 0 
30 6 
33 0 
60 0 
60 2 
50 2 
60 0 
40 0 
48 0 
78 0 
46 2 
52 0 


Ps 
Ego strength 
scale 


@ 
_ 
w 





INTERPRETATION OF EVS 


The process of improvement may be elucidated by an analysis of the item con- 
tent on EVS compared with Ps. Before treatment, the items which distinguished 
good from poor prognosis pertained mainly to interpersonal relations. That is, before 
shock treatment the patient with a good prognosis perceived possibilities of good 
relations with the family, on the job, socially, etc., while the reverse held true for the 
patient with a poor prognosis. It has been suggested“? that the responses of the 
unimproved indicate ego-alien externally directed aggression. The answers of the 
good prognostic bet seemed to belie his clinical status. By design the responses of all 
groups to EVS are quite similar before treatment. The items which change with 
shock treatment for the recovered but not the unimproved are, however, quite different 
from the content of Ps. Thus the items on EVS are mainly symptomatic in nature 
referring to bodily concerns, e.g., A-40, “‘I feel weak all over much of the time’’; or 
vague fears, e.g., G-2, “Several times a week I feel something dreadful is about to 
happen’’; or worry, anxiety, and tension, e.g., G-6, “‘I certainly have had more than 
my share of things to worry about,’’ F-45, “I brood a good deal,’’ and I-35, “‘Life is a 
strain for me much of the time”’; and to personal inadequacy, e.g., I-37, “I certainly 
feel useless at times.’’ The few interpersonal items reflect a feeling of social isolation, 
e.g., E-43, ‘‘When in a group of people, I have trouble thinking of the right things to 
talk about.’’ These items, then, largely refer to intra- rather than interpersonal 
tensions. The striking content differences between Ps and EVS point up the in- 
tricacies in understanding the verbal behavior of clinical groups. 

In so far as these verbal self portraits mirror the clinical picture, we might sur- 
mise that shock treatment has mainly produced a symptom reduction; the improved 
patient is less tense and anxious, less preoccupied with bodily malfunctioning, less 
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suspicious and somewhat more optimistic, and finally less withdrawn socially. Al- 
though he seems to have a much more positive self-evaluation, there is little indica- 
tion of better interpersonal relations. This portrait of the improved patent sug- 
gests a symptomatic cure. Perhaps, these results reflect the kind of change produced by 
somatic treatment. 

The unimproved cases tend to be “‘plus’’ getters, that is they give statistically 
infrequent responses both before and after treatment while the recovered cases tend 
to change. In this respect the high item overlap of EVS with Pr as well as the sharp 
increase in K scores®? of recovered patients after treatment can be noted. From 
these diverse bits of information, it is possible that EVS also in part taps the same 
test taking attitude reflected in such scales as L, K, F, N, and Pr“. Meehl sug- 
gests that more than a mere test taking attitude is inherent in the answers to such 
scales; there is some reason for believing that the adequacy of the patient’s ego 
strength and defensive structure is involved. The present results lead the writer to 
suggest that ‘“‘plus’’ getting in a normal record may be a sign of realistic self criticism 
and good ego strength while ‘‘plus”’ getting in an elevated profile may indicate hostil- 
ity, resignation, and inadequate ego defenses. It is possible that EVS measures one 
facet of ego strength, namely a decrease in internal tension reflecting a more positive 
self-evaluation. This factor may be a pre-condition for a fuller recovery but may or 
may not be accompanied by better interpersonal relations. 

Test scores on EVS are simple raw scores which in practical usage would be 
taken only from post-shock MMPI records. We assumed that high scores indicate 
changes in test behavior compared with pre-shock scores. As the scale is now used, 
the magnitude of the change does not affect our interpretation. It is quite possible 
that a ‘‘difference’’ score between pre- and post-shock scores on EVS might be more 
meaningful. However, difference scores based upon arithmetic differences between 
pre- and post-shock EVS scores actually turned out to be less discriminating than the 
results reported in this study. This finding is understandable in view of the fact that 


pre-shock EVS scores are somewhat lower in the unimproved group than in the re- 
covered group. It is possible that a more refined measure which takes into account the 
initial discrepancy in level between the groups might produce a difference score with 
equal or greater discriminating power than the simple post-shock EVS score. I am 
attempting to work out the technical problems involved in this transformation. 


SUMMARY AND CONCLUSIONS 


This study attempted to derive empirically a scale to evaluate response to shock 
therapy. 

The resultant scale, labeled EVS, can be considered moderately valid as a tool 
to evaluate shock therapy. Validity was impaired mainly because a sizable num- 
ber of unimproved patients obtained favorable test scores. 

Scores on EVS after treatment were positively correlated with pre-shock scores 
on a prognosis scale but only to a moderate degree. 

In the main, the content of items on EVS reflected symptoms and not inter- 
personal relations. This suggests that improvement with shock treatment is 
largely a process of symptom abatement with no real changes in interpersonal 
relations. 
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A VALIDATION OF MEASURES THAT PREDICT THE 
EFFICACY OF SHOCK THERAPY! 


DONALD K. PUMROY? AND WILLIAM 8. KOGAN 
University of Maryland VA Hospital, Seattle, Washington 


PROBLEM 


In recent years psychologists have become increasingly interested in predicting 
whether a mentally ill person will benefit from a particular type of treatment such 
as insulin and electroshock (EST). Several studies have used the Minnesota Multi- 
phasic Personality Inventory (MMPI) to predict the response to shock therapy. 
Feldman“? empirically derived a prognostic scale (Ps) for shock therapy from an 
item analysis of the MMPI’s of two groups of Ss, one of which recovered and the 
other which remained unimproved as the result of shock treatment. This scale was 
then found to discriminate between recovered and unimproved patients in an in- 
dependent sample. Using the MMPI profile as a basis, Welsh“ developed an 
anxiety index (AI) and an internalization ratio (IR). He found that the higher the 
AI or the IR, the greater the likelihood of patients benefiting from the different 
types of treatment studied. 

Pearson °? analyzed the pre-treatment MMPI’s of male patients given EST and 
later rated by a psychiatrist as to their response to treatment, finding three profile 
signs related to a poor response to EST as follows: the T score on the ‘‘D”’ scale was 
greater than 84 or less than 55, the T score on the “‘Sc’”’ scale was greater than 84, at 
least four of the nine clinical scales had T scores of 80 or above. He found that the 
greater the number of these signs on a patient’s MMPI, the greater the likelihood 
of a poor response to EST. 

The purpose of this study was to apply these measures of prognosis of EST to a 
new group of patients in order to provide additional data on the validity of the 
measures. 

PROCEDURE 

The subjects consisted of all testable patients from a VA general hospital who 
were assigned to EST between May 1953 and November 1954. During this period, 
77 patients were assigned EST, and 65 of these patients had ceased EST by the end 
of November 1954 receiving 879 EST’s or a mean of 13.5. Twenty-three patients 
were testable, all male veterans age 22 to 62, mean of 37.5. They received a total of 
331 EST’s with a mean of 14.4. 

Fach subject was given the MMPI from 1 to 7 days prior to the start of his 
EST series. The test was given with standard instructions except that the subject 
was told to answer all of the items. Four indices were ascertained: (a) Feldman’s 
Ps scale, (b) Welsh AI signs, (c) Welsh IR, and (d) Pearson’s three signs. These 
indices, of course, were determined only after two criterion measures had been ob- 
tained. The first and primary criterion was a rating of improved or unimproved 
made by the patient’s psychiatrist. This rating was made approximately one month 
after the patient terminated EST. The other criterion was the number of EST’s the 
patient received. As the policy of the hospital was to remove a patient from EST 
when he showed improvement, it was assumed that a smaller number of EST’s was 
related to improvement. 

RESULTS 

The psychiatrists rated 16 patients improved and 7 unimproved. These two 
groups are compared as to mean scores on the Ps, AI, and IR, and the number of 
Pearson’s signs in Table 1. 


TaBLe 1. MEANS ON THE Ps, AI, IR anp PEARSON’s SIGNS 








Factors Ps AI IR Signs 





Unimproved 26 96 .6 112.9 2. 
1 


1 
Improved 24.3 102.8 119.6 me 





1This study was conducted at the VA Hospital, Seattle, Washington, with the cooperation of the 
Department of Psychiatry, University of Washington. 
*Formerly at University of Washington. 
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To check the significance of these differences, ¢ tests were used. Although none were 
significant, all of the measures except the Ps were in the direction predicted. Table 2 
indicates the correlation between the two criterion measures (psychiatric ratings 
and the number of EST’s) and the four prognostic measures. Product moment cor- 
relations were computed between the prognostic measures and the number of 
EST’s; biserial correlations were used with prognostic measures and the ratings. 
TaBLE 2. CORRELATIONS BETWEEN PROGNOSTIC MEASURES, 
NuMBER OF EST’s anp Ratinas 





Factors Number of EST’s Ratings 








~.04 
-.21 
—.42* 


—-.37 





*Significant at the 5% level. 


Only the correlation between IR and the number of EST’s was significant. This is 
in the hypothesized direction; the higher the IR the fewer the number of EST’s re- 
ceived. The two criterion measures were not highly correlated (r = .14). Actually, 
there was a slightly greater chance of being rated improved with an increase in the 
number of EST’s administered. 


DISCUSSION 


In general, the results of this study did not confirm the ability of these measures 
to predict the efficacy of EST. The reason for this lack of confirmation may lie in 
two areas. The first possibility concerns the sample used in studies of this nature. 
Probably of most importance is the selection of the individuals who are to receive 
EST. This is most commonly done by a psychiatrist, who selects certain patients for 
EST in terms of rather subjective criteria. In some cases the whole psychiatric staff 
has a certain unwritten policy as to which patient will receive EST. It seems quite 
possible, therefore, that the cases selected for EST at one hospital may be different 
from those at another hospital. When this takes place the measure to predict prog- 
nosis for EST devised at one hospital will be more likely to fail in another hospital. 
It seems important, therefore, to validate such a measure in several different settings 
and to objectify psychiatric indications for giving EST. In this study only 23 out of 
the 77 patients who were given EST were testable. Consequently, in extrapolating 
from these results, only one patient out of three who was to receive EST could be 
evaluated on these prognostic measures of response to treatment. Another possible 
explanation for the discrepancy may be in the criterion of improvement where raters 
may have had different frames of reference. There is a great need, in psychological 
research of this type, for a criterion that is objective, inexpensive and can be ob- 
tained shortly after treatment. 

SuMMARY 

Four different MMPI measures (Feldman’s Ps, Welsh’s AI and IR, and Pear- 
son’s signs) purporting to predict the efficacy of EST were applied to a sample of 
mental patients. Twenty-three male veterans were given the MMPI before they re- 
ceived EST. Two criteria of improvement were used: (a) ratings by psychiatr'sts, 
and (b) number of EST’s received. Only the relationship between number of EST’s 
and the IR was significant. In general, the measures were not able to predict the , 
efficacy of EST with this sample. 
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AN EVALUATION OF THE GRASSI TEST FOR 
ORGANIC INVOLVEMENT 


MICHAEL HIRT 
University of Nebraska 


PROBLEM 

One of the outstanding symptoms of organic patients, depending upon the 
location and extent of the lesion, is the loss of ability to function at the abstract 
level. A normal subject is capable of utilizing both the concrete and abstract ap- 
proach in solving problems, whereas damaged or otherwise impaired subjects have 
difficulty with abstract responses which are among the first affected. It therefore has 
become necessary to develop sensitive psychological tests for detecting intellectual 
defects due to organic involvement. Yates ®) has made an excellent evaluation of the 
psychological tests of brain damage. 

The Grassi Block Substitution Test“? consists of a set of colored cubes similar 
to the Kohs blocks. Each block has one side painted yellow, one white, one red, one 
blue, one red and white, and one yellow and blue, the latter two sides being equally 
divided diagonally. Each of the 24 cubes in the test is identical in size, color, and 
color relationships. It is important that all of the cubes be identical to the most 
minute detail, not only to allow the subject the possibility of completing all of the 
designs, but also to avoid confusing organic patients who are often unable to over- 
look slight color and hue differences. 


PROCEDURE 
The clinical subjects used in this study were selected from two Iowa state insti- 
tutions. The hospital staffs assumed complete responsibility for selecting subjects. 
The need for patients whose diagnosis was unequivocal was impressed upon them. 


Both institutions supplied patients whose clinical histories were sufficiently complete 
to warrant concluding the presence or absence of organic involvement. The organics 
were either post-lobotomy cases, syphilitics, grand mal epileptics with years of 
numerous daily seizures, or alcoholics of long duration. The functional group con- 
sisted almost entirely of schizophrenics. In selecting normal subjects, the only limit- 
ation imposed was the effort to use normals of comparable age to that of the patient 
population. The final investigation contained data from 120 cases, ranging in age 
from 16 to 78 years and representing intelligence levels from moron to very superior. 
Prior to testing the author had no knowledge of any patient’s diagnosis. 

In administering the Grassi test, the subject works with four cubes. With these 
he is to reproduce either parts of, or the complete, stimulus design. The Grassi test 
consists of five designs. Each design is reproduced at four levels of increasing com- 
plexity. The simple concrete level requires the subject to reproduce only the top of 
the stimulus design; the simple abstract level requires the subject to again reproduce 
only the top of the stimulus design, but using different colors. The complex concrete 
level requires the subject to reproduce the entire stimulus design; while the complex 
abstract level requires the subject to use different colors in reproducing the entire 
stimulus design. 

An effort was made to control the variables of age and sex. Practical considera- 
tions, however, made it impossible to match the functional, normal, and organic 
groups completely. Although the range of ages for the three groups was not identical, 
the mean ages were within two years of one another. The correlation between age 
and Grassi test score was not reliably significant; neither was the ¢t test between the 
mean scores for the male and female subjects. 


RESULTS 
Table 1 indicates the distribution of Grassi scores for the entire sample. Of the 
40 normal subjects, not one received a score below 20. Of the subjects with known 
organic pathology, not one scored above 16. 
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TABLE 1. DISTRIBUTIONS AND PERCENTILE RANKS OF SCORES ON THE GRASSI 
Biock SuBSTITUTION TEST FOR ORGANIC, SCHIZOPHRENIC AND NORMAL GROUPS 
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The normals made the greatest number of errors at the complex abstract level. 
The same pattern of errors seems to repeat itself in the functional group, although 
there is also a marked increase in the number of errors made on the simple abstract 
level of the more complicated designs. The deterioration accompanying brain 
damage is exemplified to its fullest extent in the organic group. Abstract behavior, 
which is one of the first functions affected, is most noticeably absent in the perform- 
ance of organic subjects. Accordingly, they committed the largest number of errors 
at the simple and complex abstract levels. 

Since the Grassi test measures the ability to function at the abstract level, and 
since the ability to abstract is influenced by intelligence, the author considered it 
pertinent to make some effort at equating the functional, organic, and normal ele- 
ments in regard to this factor. Accordingly, measures of intelligence were obtained 
for all the subjects, and these were correlated with Grassi test scores. A coefficient 
of .484 was obtained. 

Grassi“: P- *), using the test-retest method, reported a reliability coefficient of 
.85 for the Grassi test. Since the validity of the Grassi test was found to be so high, 
it was considered adequate to accept the reliability reported by Grassi. 


Discussion 

A subject’s diagnosis should be made on the basis of an analysis of his test be- 
havior and intellectual level. Disregarding any one of these factors will tend to 
lower the reliability and validity of the diagnosis. If a subject is classified on the 
basis of test score alone, the clinician might easily be misled as to the patient’s 
actual level of performance. For example, a score of 18 would not be considered in- 
dicative of impairment in an individual of borderline intelligence. On the other hand, 
a score of 18, if obtained by a subject of superior intelligence, would be strongly sug- 
gestive of moderate impairment. Also worthy of consideration is the fact that a 
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score of 12 for a subject of superior intelligence is indicative of more extensive or- 
ganic deterioration than the same score for a dull subject. 

Grassi has suggested a score of 20 as the cutting point between organics and 
non-organics. The results of this study, however, tend to indicate a score of 16 as 
more appropriate. 


SUMMARY 

One hundred twenty subjects, equally divided into organic, functional, and 
normal groups, were tested with the Grassi Block Substitution Test. All of the nor- 
mal subjects scored above 20 and all of the organics fell below this score, suggested by 
Grassi as the cutting point between normal and brain damaged subjects. Although 
all of the funtional subjects scored higher than the organics, approximately half of 
them fell below 20, which. according to Grassi, would suggest moderate organic im- 
pairment. The results of this study support the general conclusion that the Grassi 
test should take its place among other diagnostic instruments. 
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AN EVALUATION OF THE “ORGANIC SIGNS” IN THE H-T-P DRAWINGS* 
VYTAUTAS J. BIELIAUSKAS AND SANDRA L. KIRKHAM?! 
Richmond Professional Institute 


INTRODUCTION 


Conspicuous changes in the performance of organic brain injured cases have 
been noticed since time immemorial by popular observers and literary writers. 
Authorities on aphasia such as Goldstein“, Head“, and Riese“! have reported 
specific changes in verbal communication. In full agreement with these authors, 
Jaspers?) went a step further, suggesting a definite change in all the performance 
areas of a person with cortical disturbances. Basing his hypothesis upon various re- 
ports of experimental observers, Jaspers‘: »- 4-45) suggested that drawings by or- 
ganic persons must show specific characteristics, 7.e., omissions, exaggerations, and 
signs of loss of Gestalt. The research and theories advanced by more recent authors 
(@, 2, % 18) primarily concerned with the diagnostic aspects of drawings, imply that 
there are some differentiating characteristics in the drawings of organic patients 
from those of normals. 

Buck ®: *) and Jolles‘ have advanced hypotheses concerning specific signs sug- 
gesting organicity in the House-Tree-Person (later referred to as H-T-P) drawings. 
Some aspects of these hypotheses have been investigated by Michal-Smith®, 
whose research was unable to give definitive support to Buck’s and Jolles’ theories, 
but gave some tentative indications that the trends of the findings tended to support 


*Grateful appreciation is expressed herewith to Dr. W. I. Prichard, Superintendent, and Miss 
Hannah 8. Davis, Chief Psychologist, Lynchburg Training School and Hospital, Colony, Virginia, for 
their assistance in securing the sample for this study, and to Dr. Nicholas M. Vincent for his advice 
on statistics. 

'1The co-author is now with the Department of Psychiatry and Neurology, The Bowman Gray 
School of Medicine, Wake Forest College, Winston-Salem, North Carolina. 
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Buck’s assumptions. Therefore, the present study was designed (1) to restudy some 
findings of Michal-Smith in order to reach more conclusive evidence as to the validity 
of H-T-P “organic signs’’, and (2) to test additional “organic signs.” 


SUBJECTS 


Forty subjects were selected from the population of the Lynchburg Training 
School and Hospital. The staff diagnosis made on the basis of social history, medical 
history, and psychological evaluation was used as criterion for distinction between 
organic and non-organic subjects. Only cases diagnosed as having some definite 
structural damage of the brain tissue were included in the organic group. The non- 
organic group was selected from those classified as either familial mental deficiency 
or without mentai deficiency. All whose diagnoses included suggestions of other 
disorders, 7.e., psychosis, neurosis, or character disorder, were excluded from the 
sample. Because of rigid selection, only twenty subjects were obtained which could 
be classified as organic. Twenty non-organic subjects were used in the control group. 
The groups were matched for: sex, age, months in institution when tested, form of 
Wechsler given, and the,IQ’s. The results of the matching are shown in Table 1. 


TABLE 1. MATCHING OF ORGANICS AND NON-ORGANICS ON Sex, AGE WHEN 
TestTED, LENGTH IN INSTITUTION, ForRM OF WECHSLER GIVEN, AND IQ Scores 








Matching Variables Organics Non-Organics 





Sex 
Males 12 
Females 8 
Age when tested 
Range 14 - 33 
Mean 22 
Standard deviation 6.6 
Months in institution when tested 
Range 1 - 193 
Mean 62.95 
Standard deviation 45.7 
Form of Wechsler given 
Form I 14 13 
0 1 
5 5 
, 1 1 
Verbal IQ 
Range 51-101 52 - 100 
Mean 68.1 67.7 
Standard deviation 11.8 11.6 
Performance IQ 
Range 37 - 104 42 - 102 
Mean 63.9 64.0 
Standard deviation 18.2 15.2 
Full Scale IQ 
Range 45 - 94 45 - 88 
Mean 62.9 63.5 
Standard deviation 14.3 12.7 





The H-T-P drawings of each subject were scored according to eighteen objective 
criteria suggested by Jolles“? and Buck ®: *) as shown in Table 2. Only signs which 
could be evaluated (measured) objectively were included. All other signs mentioned 
by Buck and Jolles which in any way required subjective interpretation on the part 
of the scorer were excluded. 

A Chi-square and Phi coefficient were computed for each item to determine if 
there were any significant differences. Pearson’s r’s were obtained from the Phi 
coefficients to account for possible continuity of variables®: »-*, A ¢ test was run 
to determine the differences between the mean numbers of the scores on objective 
criteria of both groups. 
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Tasie 2. ErcHTreen Ossective [tems INDICATING OrGANIcITY iv H-T-P Drawinas FROM 


Buck®: ») anp JoLLEs 





House 


Tree 


Person 





. Details, irrevelant: hori- 
zontal line separating first 
and second stories. 


. Foundations: pillars ab- 

normally high (height of 

— one-fourth height of 
ouse or higher). 


. Steps leading to blank wall. 
. Walls, disconnected. 
. “Facade’”’ drawn. 


. House drawn as toppling 
(angle of 75 degrees or less 
—measuring from base line 
of house to axis of paper). 


. Omits one or more of the 
“essential’’ details. Essen- 
tial details for house are a 
roof, a wall, a door, a 
window, and a chimney. 


8. Trunk, one-dimensional, 


with one-dimensional 
branches that do not form 
a system. 


. Dangling tree roots (roots 
above the ground line). 


. One or two dimensional 
trunk and one or two di- 
mensional branches some 
of which are not attached 
to trunk. 


. Tree drawn as falling (an- 
gle from trunk to base of 
paper less than 75 de- 
grees). 


. Omits one or more of the 
“essential’’ details. Essen- 
tial details for tree are a 
trunk and one branch. 


. Head, large in relation to 


trunk (H:T less than 1:1). 


. Fail to join... arm or legs 


to trunk. 


. Full face, except perhaps, 


for the head. 


. One-dimensional neck. 


. Person drawn as falling 


(angle of trunk to base of 
paper less than 75 de- 
grees). 


. Omits one or more of the 


“essential’’ details. Essen- 
tial details for a person are 
a head, two eyes, two ears, 
a nose, and a mouth, and a 
trunk with two arms and 
two legs. 





RESULTS AND DIscussION 


The Chi-squares, Phi coefficients, Pearsonian r’s, and ¢ tests are presented in 
Table 3. None of the eighteen items shows a Chi-square significant at the 5 per cent 
level. No significance can be deduced on the basis of either Phi coefficients or the 


converted Pearsonian r’s. Neither does the Phi coefficient for both groups indicate 


differences sufficiently significant for individual prediction. 


Likewise, the ¢ test 
values are not significant. 


TABLE 3. NUMBER OF ORGANIC AND Non-OrGANIC SuBJECTS SHOWING THE 
CHARACTERISTICS OF THE 18 OBJECTIVE CRITERIA OF ORGANICITY WITH 
Cut-Square, Put CoErFicrent, PEARSON r, AND t TEsT 





Item No. 


Non- 


Organic Organic x? () 


"o 





— 


SR OPR NOOR WOONNK Oh 


— 


.150 


. 250 
.226 
. 226 


.050 


reoooo . 


SCROKR OOK RINK AOOWFH Ob 
< SN 
— wn 


_ 
oc © 





- 


Qo 
oo 


.022 


.55 





Since the statistical values obtained on the objective scoring criteria do not 


show significant findings, it appears that the signs postulated by Buck and Jolles as 
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discriminators of organic brain damage used in this study were inadequate to dis- 
criminate between the drawings of this population. The highest correlation was 
negative (—.42) on the item referring to failure in joining of the arms or legs to the 
trunk. The drawing of the person as toppling was also negatively correlated (—.40) 
with organicity, while the item dealing with drawing of a person in full face, except 
for head, was positively correlated (.40) with organicity. Since all of these correla- 
tions are far from significant for individual predictions, and only very few are ap- 
proaching values required for group prediction, it seems that the items examined in 
this study are not valid discriminators of the organicity. 

There are two objections which could be raised against this conclusion. First, 
it may be said that most mental defectives might be conceived as organics, and since 
our study included some subjects classified as familial mental deficiency, the com- 
parison would not tend to distinguish adequately between the two groups. Should 
the idea that most mental defectives are also organics prove to be correct, then it 
would seem that there would be a trend indicating higher frequency of items ap- 
pearing in the drawings of both groups. However, such a trend cannot be perceived 
in the results obtained. Only two items exceeded half the possible score for each 
group (items 5 and 18), while sixteen other items fell considerably below. 

Second, the smallness of the sample is well recognized. However, it should be 
pointed out that twenty subjects was the maximum sample which could be obtained 
from a population of 2300, in applying the rigid criteria of matching and diagnosis. 
Therefore, the sample could be considered as highly representative. Of course, it 
would be desirable to repeat this study on a larger sample, if such a sample meeting 
all criteria could be obtained. 

Nevertheless, it seems that it is reasonable to assume that at least a cautious 
attitude is indicated in using the so-called “organic signs’ in H-T-P drawings as 
organicity discriminators, as long as the support for their validity rests on such 
tenuous grounds as the Michal-Smith study and the present findings. A recent study 


by Silverstein and Robinson “”? of drawings of poliomyelitic children done for another 
purpose but in which H-T-P criteria were used did not bear out any signs of signifi- 
cant frequency, giving again some support to our findings with adults. 


SUMMARY AND CONCLUSIONS 


Sets of H-T-P drawings were obtained from forty white institutionalized sub- 
jects including 20 organic and 20 non-organic cases. The subjects were matched for 
sex, age, months in institution when tested, form of Wechsler given, and the IQ’s. 
The drawings were examined for 18 objective criteria claimed to be indicative of 
organicity. Chi-squares, Phi coefficients and Pearsonian r's were computed on each 
item and af test on the difference between the mean scores of groups was run. None 
of the statistical results showed significant differences. Therefore, it was concluded 
that the eighteen criteria claimed to be indicative of organicity did not adequately 
discriminate either individually or together the organics in the population of this 
study. Although a larger sample may give more conclusive evidence, it was sug- 
gested that extreme caution be exercised in using these signs as organicity indicators 
in H-T-P drawings. 
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HUMAN FIGURE DRAWINGS BY YOUNGER AND OLDER ADULTS 


IRVING LORGE, JACOB TUCKMAN AND MICHAEL B. DUNN 
Teachers College, Columbia University 


PROBLEM 


Dunn“? has demonstrated that drawings of the self by boys and girls, 4 to 18 
years of age, show a progression in maturity of representation, from an undiffer- 
entiated drawing at age four or five, to a well-integrated human figure at age fourteen. 
Drawings of post adolescents seem to be at the same level as those of 14 year olds. 
Dunn suggested that human figure drawings may be used as evidence of psychic 
intactness of the self in that the drawings are positively correlated with age, grade, 
intelligence and aspects of adjustment. The aspect of adjustment appraised was the 
distinction between the drawings of deviants (feeble-minded, neurotic, psychotic, 
and brain damaged) and those of presumed normals. The drawings of deviants were 
found significantly less mature than those of normals of the same chronological age. 

Presumably, if people suffer physiological and psychological deficits as they 
grow older, such changes may be reflected in their drawings of the self. The purpose 
of this study, therefore, is to’ test the hypothesis that the drawings of older people 
show increasing loss of intactness. Drawings of sophisticated younger adults where 
physiological and psychological deficit should be relatively minor will be compared 
with those of older adults where the probability of such deficit should be greater. 


SAMPLE AND METHOD 


The younger adults were seventy-five students (average age 32) enrolled in a 
graduate course on the psychology of the adult; the older adults (average age 73) 
were 104 individuals, some living in an institution for the aged and some participating 
in recreational day centers for old people in New York City. 

“Draw a picture of yourself, standing, facing front view as if you were looking at yourself in 
a full-length mirror. We are not the least bit concerned with whether or not you can draw like an 


artist. We are not measuring art or art ability. Don’t think too much about it, but just draw. 
Be at ease and whatever you draw will be satisfactory.”’ 


For the younger adults and for most of the older adults, the drawings were completed 
in group sessions. The younger adults were given a time limit of six minutes; the 
older subjects had no time limit. Just a few needed more than fifteen minutes to 
complete the drawing. 
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RESULTS 


The first appraisal of the drawings rated them on the 20-point scales, one for 
boys and the other for girls, made by Lorge and Dunn") to evaluate holistically the 
maturity of the representation of drawings by children and adolescents. The mean 
score for the drawings by graduate students and by older subjects sharply differ- 
entiates between the younger and older adults. For younger adults, the mean scale 
value for men is 14.1, and for women, 14.0, which correspond to the point at which a 
leveling takes place among children and adolescents. For older adults, the mean 
scale value for men is 7.7, which corresponds to the average drawings by nine-year 
old boys; for women, 7.3, which corresponds to the average drawings by seven and 
one-half year old girls. 

The method of appraisal by the children’s scale fails to give consideration to the 
qualitative differences between children’s drawings and those by younger and older 
adults. For instance, in contrast to the drawings of children, those of graduate 
students are of “‘more mature’’ persons, e.g., the human figure looks fuller, the face 
looks older, wrinkles show, clothes are more adult, glasses are shown, etc. The quali- 
tative differences between children’s drawings and those by older adults are even 
greater. When a recognizable human figure is drawn by an older adult, the drawing 
looks like a person beyond the age of 50, 7.e., the face looks older, more wrinkles are 
shown, etc. 

Superficially, some aspects of the drawings of older adults and of very young 
children are similar. The drawing of a four year old, as may be expected, is so im- 
mature that the sex of the respondent cannot be determined. Of the drawings by 
older men and women, 24 percent are as poor or poorer than those of the average four 
year old, and two-thirds of these cannot be identified for sex of the respondent. Yet, 
these adult drawings of the self do not look like those drawn by children. 

Since the Lorge-Dunn scale was not a wholly satisfactory way of appraising the 
drawings by adults, seven criteria were developed for comparing the drawings of 
younger and older adults: (1) identification of the sex of the respondent, (2) intact- 
ness: no omission of a major part of the body, e.g., legs, arms, hands, feet or torso, 
(3) integration, 7.e., evidence of body gestalt, (4) depth (tridimensionality or the 
impression of solidity) rather than flatness, (5) good proportion of the body parts, 
(6) absence of bizarreness, and (7) evidence of adequate motor coordination. 

Appraising the adult drawings by these seven “either/or’’ criteria gave evidence 
of significant differences between the younger and the older adults. For younger 
adults, the sex of every respondent could be identified; for the older adults, sex 
could not be identified for 18 percent and for 13 percent could not be identified cor- 
rectly. For the younger adults, 85 percent of the drawings were intact compared 
with 25 percent for the older adults. Moreover, there were qualitative differences 
between young and old. The younger adults omitted such parts of the body as hands 
or feet, but the older adults omitted not only hands and feet, but also arms, legs, and 
torso. In a number of cases, the drawings by older adults were just shells, 7.e., just a 
head, supported by legs with the entire torso missing. For the younger adults, all 
drawings gave evidence of a body gestalt, compared with 45 percent of older adults. 
For the younger adults, 77 percent of the drawings gave evidence of depth of the 
human figure, compared with 18 percent of the older adults. For the younger adults, 
97 percent of the drawings showed good proportion of the body parts, but only 45 
percent of the drawings of older adults did so. For the younger adults, 97 percent of 
the drawings were not bizarre, compared with 62 percent for the older adults. For 
the younger adults, no drawing suggested evidence of poor motor coordination, but 
13 percent of the older adults suggested such evidence. 

Undoubtedly, the different criteria are interrelated, i.e., an intact drawing 
tended to show body gestalt, good proportion of the body parts, depth, absence of 
bizarreness, adequate motor coordination, and made possible estimates of the sex of 
the subject, while a drawing lacking intactness tended to show inadequate gestalt, 
poor proportionality, ete. Using a numerical weight of one for each of the seven 
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criteria, two-thirds of the drawings by younger adults received a maximum score of 
seven; none, a score of less than five. By contrast, only one percent of the drawings 
by older adults received a score of seven and 58 percent a score of zero to four. 

The two methods of appraisal overlap insofar as the Lorge-Dunn scale gave 
holistic consideration to the seven criteria. The correlation between the seven cri- 
teria score and the scale value by the Lorge-Dunn scale is .75 for older women, .93 for 
older men, .25 for younger women, and .45 for younger men. The correlations for 
older men and women and younger men are significantly different from zero. The 
correlation between score for criteria and scale value underestimates the relation for 
younger adults because of the marked restriction in range, t.e., two-thirds were rated 
favorably on every criterion. The relationship between the two appraisals tends to 
be closer for men than for women although significant only for the older adults. 
Although the relation between the two appraisals is close, separate scales for younger 
and for older adults, similar to the Lorge-Dunn approach for children, seem desirable 
to account for the qualitatitive differences in adult drawings. 


DISCUSSION 


The sample of younger adults and of older adults cannot be considered to be 
representative either of younger adults or of older adults in the population. Many of 
the differences found in this study may be a function of the method of getting the 
cooperation of the individuals who participated in the experiment, as well as where 
they happened to be when such requests were made. At best, the evidence does 
suggest that older adults, whether in homes for the aged or ambulatory participants 
in a recreational program for the aged, do make drawings of the self which differ 
markedly from those of graduate students and from children in general. While it is 
true that the correlation between scale values established on children’s drawings and 
criteria of proportionality, intactness, and so on, do correlate significantly, correla- 
tions suggest that something more, or rather, less, is in the drawings of older adults. 

For younger adults, however, the drawings are so homogeneous, in general, both 
with reference to scale value and with reference to the criteria, that the data must be 
considered as evidences of the fact that there is a leveling off in adolescence which is 
maintained up to the later 30’s. The use of drawings of the self as a further part of 
the evaluation of the adult personality should lead to significant contribution to the 
aging process. 


SUMMARY 


A comparison of the drawings of the self by graduate students (average age 32) 
with those by institutionalized and noninstitutionalized older adults (average age 73) 
supports the hypothesis that the drawings of older people show increasing loss of 
intactness. In contrast to the human figure drawings by younger adults, the draw- 
ings of older people are characterized by incompleteness, lack of integration, flatness, 
lack of proportion, bizarreness, evidence of inadequate motor coordination, and 
difficulty in identifying the sex of the respondent. In contrast to the drawings by old- 
er adults, those by younger adults are quite homogeneous. It must be recognized 
that the samples in this study are not representative of younger adults or of older 
adults in the population. 
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MANIFEST CONTENT AND INTERPRETIVE MEANING OF 
VERBAL INTELLIGENCE TEST RESPONSES 


NORMAN TALLENT 
Veterans Administration Center, Kecoughtan, Virginia 


PROBLEM 


How valid is content interpretation of responses to verbal intelligence test 
items as an indicator of overt behavioral trends? This study attempts to validate 
Rapaport’s finding“? that subjects who indicate they would yell ‘‘fire’’ in response to 
Wechsler’s “? Comprehension item 2—the “theatre item’’—.show impulsive trends in 
their everyday behavior. In addition, other responses which appeared to suggest 
that the subject is impulsive, or that he shows tendencies to impulsive behavior, 
were examined against evaluations of behavioral control. 


PROCEDURE 

The sample consisted of 198 boys who constituted the ninth grade of a public 
school system. Their behavioral control was appraised in terms of averaged ratings 
by their teachers on the Behavioral Control Rating Schedule (BCRS).®: *) which pro- 
vides ratings of behavioral control in terms of a number of postulated component 
variables. An attempt was made to secure five ratings for each subject, but practical 
considerations reduced the number obtained to an average of 4.33. On the basis of 
these averaged scores of the BCRS, 120 boys taken from the larger group were as- 
signed to one of three experimental groups of 40 each, labelled ‘‘Well-Controlled 
Group”, “Impulsive Group”, and “‘Average Group’’. All of these subjects were then 
tested with the Wechsler-Bellevue Intelligence Scale, Form I. 


RESULTS AND DIscUssION 

A number of the responses given might be indicative of tendencies to impulsive 
behavior in accordance with the hypothesis that those who say they would yell 
‘fire’ are impulsive. Thus, three subjects of the Well-Controlled group, and two 
subjects of the Average group gave such a response. None of the Impulsive group 
responded in this fashion. The hypothesis that subjects who say they would yell 
“fire’’ are impulsive is thus unsupported when impulsivity is defined in terms of low 
behavioral control as rated on the BCRS. 

In addition to testing this hypothesis, it seemed worthwhile to determine if 
any classification of responses could be established which might be indicative of be- 
havioral undercontrol. Thus all responses which mentioned some form of overt im- 
pulsive action were tabulated. Similarly, responses which suggested that the subject 
might have impulsive tendencies were listed. Altogether, five categories of responses 
having a possible relationship to behavioral control were identified as follows: 

1. Yell “fire.’’ Also shout, scream, etc. 

2. Letting others know (except manager or ushers), or warning people, but no 

specific mention of yelling. 

3. Other overt impulsive actions, e.g. running for fire extinguisher, the man- 

ager, etc. 

4. Denial of yelling “‘fire,’”’ possibly because of consequences. 

5. Mention of concern over panic, stampede, riot, danger alarming people, etc. 


The rationale for this classification of possible indicators of impulsivity would 
probably be acceptable to most clinical psychologists. Warning others of the fire 
seems to be an unthinking response, perhaps almost on a par with yelling out “‘fire.”’ 
Category 3, ‘‘other overt impulsive actions,” includes such behavior as running, 
which could precipitate panic. Category 4, ‘‘denial of yelling ‘fire,’ ’’ is included to 
check a variation of the main hypothesis; that is, the responses included in this cate- 
gory might indicate impulsive tendencies which are under control. If so, the question 
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is raised as to which of the experimental groups favor such a response. Being con- 
cerned with panic, Category 5, might be regarded similarly. 

To determine whether there is a relationship between the sort of test responses 
listed above and behavioral control, all responses of the subjects were classified by 
two psychologists as belonging to one or more of the above five categories, or 
into a sixth category which included all responses which could not be classified into 
either of the above.' (Both psychologists showed complete agreement in classi- 
fication, with the exception of one point of difference which was readily resolved.) 
The categories were then further combined, as follows. Categories 1, 2, and 3 were 
combined on the basis of including responses which mention overt impulsive actions, 
and categories 4 and 5 formed a new composite category to include responses which 
might be indicative of impulsive trends. The group of responses which could not be 
included in either of the above then formed another category. Using a chi square 
test of independence, no relationship was found between these categories and level 
of behavioral control (P = <.30 > .20). It thus appears that considerable caution 
is needed in predicting actual behavior from verbal responses to hypothetical prob- 
lems. 

SUMMARY 


The presence of a possible relationship between certain intelligence test res- 
ponses suggesting that the subject is impulsive and ratings of behavioral control was 
investigated. The findings were negative. “Considerable caution is needed in pre- 
dicting actual behavior from verbal respons. : to hypothetical problems. 
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RORSCHACH M RESPONSES AND INTELLIGENCE 
ROBERT SOMMER 
Larned State Hospital 


PROBLEM 

The relationship between Rorschach human movement responses and intelli- 
gence has been the subject of considerable research. Positive and significant correla- 
tions have consistently been found, attesting to the genuineness of the relationship 
(1, 2,3). However, there exists the question of the meaning of this correlation as Ror- 
schach M is well confounded with Rorschach H (human content). It can be asked 
whether it is the use of human content rather than the attribution of movement that 
relates to intelligence. Previous investigators have also found that M correlates 
highly with number of responses, Rorschach R, and that R correlates significantly 
with intelligence. If this is the case there exists the question of whether the cor- 
relation between M and IQ is an artifact of the correlations of R with IQ and of M 
with R. There is also the problem of whether M responses from subjects of varying 
IQ are distinguishable on bases other than quantity. 


PROCEDURE 


From the files of the Psychology Department of Southeast Louisiana Hospital, 
the records were selected of all patients for whom Rorschach protocols and Wechsler- 
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Bellevue records were available.' This yielded a population of 123 cases, 77 males and 
46 females, with a mean age of 32.2, and from all diagnostic categories. 

In the second stage of the study, from the files of the Psychology Departments 
of Southeast Louisiana Hospital and Gulfport (Miss.) V. A. Hospital, the records 
were selected of male patients who had been given both the Wechsler-Bellevue 
verbal scale and the Rorschach, and fitted into any one of nine categories based on 
number of M responses(2, 3, or 4) and IQ (ca. 80, ca. 100, ca. 120). This provided a 
nine-celled matrix with each IQ level and number of M responses represented. Every 
effort was made to secure representative IQ estimates by eliminating subjects whose 
Wechsler disclosed considerable scatter. The M responses were typed on cards and 
placed in groups of three depending upon the number of M responses. 

Three groups of six judges each were utilized: senior psychologists, each with a 
Ph.D. in clinical psychology; psychology interns; hospital secretaries. Each judge 
was asked to rate three series of cards: (a) The verbatim M responses and inquiries 
as found in the original protocols, (b) in another set of cards (different responses) all 
obvious cues of vocabulary and grammar were removed without the judges’ knowl- 
edge, e.g., the responses “two harlequins in a dance”’ was changed to ‘‘two clowns in a 
dance’’ and the response “two guys is running” was made to read ‘‘two men are 
running”’, and (c) in the third series (again different responses) all cues were removed 
except the card to which the response had been given, the type of movement, and the 
gender of the person moving. If the gender were unknown (e.g., clowns, elves) the 
male gender is used. 

For the verbatim and language equated series, the senior psychologists and in- 
terns were told that these were M responses from Ss at IQ levels 80, 100, and 120, 
and that they were to rank the three cards in each group in terms of the probable IQ 
of the respondent. The secretaries were told simply that these were statements made 
by people and they should rank them from high to low in terms of what they would 
estimate the person’s IQ to be. With the ‘‘verb only”’ series, all groups were told in 
what ways the responses had been impoverished. 

As it was desirable to have each judge rate responses at each level of response 
structure yet not feasible to have him rate the same responses at more than one level, 
three replications of the design were carried out, each with a different set of responses. 
This permitted each judge to rank a verbatim series, a (different) language equated 
series, and a (different) ‘“‘verb only” series. The order in which the series were given 
to the judge was counterbalanced. 


RESULTS 


The first correlation made was between number of human movement responses 
and verbal IQ. This proved to be .371 (p < .01)? and confirmed the relationship 
found in previous studies. The correlation between number of human responses and 
1Q is .330 (p < .01). In Table 1, R is found to correlate .291 with IQ (p < .01). 

This necessitates the use of partial correlation techniques to assess the relation- 
ship between M and IQ with the other variables held constant. These coefficients 
also appear in Table 1 and are quite informative. With the effects of R taken into 
account, both M and H continue to correlate significantly with IQ. Also of interest 
is the disentangling of the H component from the M score. When H is held constant, 
we find M just barely correlating with IQ, while when M is held constant, H loses 
any relationship to IQ. 

With the verbatim M responses, all three groups of judges scored significantly 
above chance expectancy in their IQ rankings (probable errors of 4.05, 4.91, and 2.89 
respectively). With the language-equated series, the interns and the senior psycholo- 
gists reached the .05 probability level, while the secretaries were somewhat below 


‘The wisdom of using a “randomly selected’’ psychiatric population can be questioned. At most 
however, this fact serves to limit generalization of the results to other patient populations, 
2All correlations are Pearson r’s. 
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TABLE 1, CORRELATIONS AND PARTIAL CORRELATIONS 
BETWEEN M, H,R, anv IQ 
Variables Partialed 
Correlated Out 


M 371 
H .330 
.291 

.857 

.905 

497 

R 271 

H 181 

M 130 

H 155 

.223 


R 
H M .025 











chance expectancy. With the ‘‘verb only”’ series, none of the groups scored signifi- 
cantly above chance expectancy. 

A summary of the overall analysis of variance of the data is presented in Table 2. 
It shows that the correctness of judgments does not vary with the number of M 
responses per subject. The differences in performance between senior psychologists, 
interns, and secretaries did not attain statistical significance. The only significant 
variable is response structure (whether the cues of vocabulary and grammar are 
deleted or not). Subsequent ¢ tests disclosed that the rankings of the verbatim 
material were significantly better than rankings on either the ‘‘language equated”’ 
or the ‘‘verb only ” series. The difference between the latter two series is not signi- 
ficant. 

TasB.e 2. ANALYSIS OF VARIANCE OF CorrEcT RANKINGS 








Source* SS df S F p 





Training level 

(intern, secretary, etc.) 6.31 ’ 3.16 2.21 
Number of M Responses 4.79 2.40 2.07 
Structure (verbatim, 

language equated, etc.) 24.98 2 12.49 10.76 





*None of the interactions were statistically significant. 


DISCUSSION 


The correlations add further evidence for the genuineness of the relationship 
between Rorschach M and intelligence. They also show that the correlation is not 
simply an artifact of the relationships between H and IQ or R and IQ. The rankings 
as to IQ attest to the importance of cues such as grammar and vocabulary level in 
estimations of intelligence. When presented with verbatim M responses, all groups 
scored significantly above chance in their rankings on intelligence. When the cues of 
vocabulary and poor grammar were removed, the psychologists still exceeded chance 
expectancy while the secretaries did not. This indicates that there are cues to intelli- 
gence in Rorschach M responses other than vocabulary level and grammar. The 
fact that none of the groups exceeded chance expectancy with the ‘‘verb only”’ series 
tells us that these cues are not the type of movement involved or the card to which 
the response is given. This still leaves the response phraseology (other than good or 
poor grammar) and the subject’s elaboration of his response. 

Although the present study has shown that the responses of subjects at different 
IQ levels can be distinguished by using cues of vocabulary and grammar, it leaves 
unanswered the specific question of whether ‘“‘movement quality” varies in some way 
with intelligence. To many clinicians, the subject of IQ 70 or 80 who gives four or 
even six M responses has always been an enigma. Further direct comparison of these 
M responses with those from subjects at other intelligence levels should help provide 
some of the answers. 
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SUMMARY 


To determine whether the relationship between M and IQ would appear with a 
psychiatric population and with the effects of R and H held constant, correlations 
and partial correlations were made between scores on the Wechsler Verbal Scale and 
number of Rorschach M responses for 123 psychiatric patients. The correlation be- 
tween M and IQ was supported with both H and R held constant. 

To determine whether M responses of Ss at different IQ levels are qualitatively 
different, M responses from psychiatric patients at IQ levels of 80, 100, and 120 were 
ranked by three groups of judges (senior psychologists, interns, and secretaries) as to 
intelligence. In one series of responses, all obvious cues of vocabulary level and poor 
grammar were removed. The results disclosed that for the verbatim M responses all 
groups of judges were able to exceed chance expectancy. When the obvious cues 
were removed, only the psychologists were able to exceed chance expectancy. 
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GLOBAL EVALUATION OF RORSCHACH PERFORMANCE VERSUS 
SCORES: SEX DIFFERENCES IN RORSCHACH PERFORMANCE* 


T. W. RICHARDS AND DAVID C. MURRAY 
Louisiana State University Gulfport Division, VA Center, Biloxi, Miss. 


PROBLEM 

Studies of the Rorschach test and its utility in discriminating between various 
types of personalities have shown that the customary formal scores are of little use. 
Knopf, for example, finds that patients classified as schizophrenic, neurotic, or 
psychopathic personalities are very similar in their production of location, determin- 
ants, etc. In the present study, the aim was to determine whether important differ- 
ences between personalities may be elicited with the Rorschach test by means of 
clues other than the conventional scores. Felzer®, reviewing the literature, shows 
that there is little if any difference between males and females when usual scoring 
categories are used for comparison. We asked ourselves this question: using the 
rather simple criterion of belonging to one sex or the other, is it possible for a judge to 
distinguish males from females, utilizing any signs or impressions he wishes to use? 


PROCEDURE 

The records of thirty subjects were given (without sex identification) to each of 
several judges, who were asked simply to sort the records as masculine or feminine. 
These thirty records were selected from protocols obtained by the junior author“? 
from one hundred and two students in a course in elementary child psychology at 
Northwestern University, twenty-four men and seventy-eight women. On a random 
basis, fifteen records of each sex were selected for the sorting procedure of this study. 
All references in the records that might identify the sex of the testee were elim- 
inated. Nine judges* were used; each was given the thirty protocols with a summary 
profile of Beck scores. Each was told that the subjects were college students enrolled 
in a course in child psychology, and that half the records were given by males and 
*We wish to express our thanks for their cooperation to Thomas Atkins, Herdis Deabler, Arvord 

Belden, Bernard Eingold, Miles Hardy, Lillian Hopkins, Elizabeth Price, and Norma Schlicht. 
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half by females. He was asked merely to indicate which records were those of men 
and which those of women. He was told that he could use any procedure at all in 
making his decision. 

Since the psychologist’s experience is an important variable, judges were selected 
from three levels of sophistication and experience with the Rorschach test: three 
were Ph. D. clinical psychologists, three trainees working for the Ph. D., and three 
were non-psychologists (two nurses and a secretary). 

RESULTS 
Differences between sexes in objective Rorschach Scores. Chi Square and ¢ for differ- 
ences between means were used to test significance of differences between men 
and women for the following objective scores: M, FM, m, F, FC, CF, C, Ft, Y + 
YF+FY, V + FV, Sum C, R, H, Hd, H + Hd, A, Ad, A + Ad, Abst., An., Anth- 
rop, Art, Bot, Clo, Fire, Food, Household, Landscape, Personal. Of these twenty- 
nine variables, only two differentiated between the sexes at a level better than 
chance (Table 1). 
TaBLe 1. Rorscuacw Facrors DIFFERENTIATING MALES AND FEMALES AT 
Berrer THAN CHANCE LEVELS 


Men ~~ Women __ Probability Value 








3.1 4.7 
9 


.02 


Median 2 
Mean 15.9 13.5 
Median 14 12 .05-.10 


It may be concluded that there are no significant differences between the sexes in 
terms of conventional scoring categories. Thus, our groups are similar in this respect 
to those of Felzer®? and others. 


“Global’”’ judgments of sex tdentification. The expected frequency of identification 
for each cell in a fourfold table by chance alone would be 7.5 (7 or 8); that is, half the 
men would be designated as male on the basis of chance alone. The number of judges 
matching a given sex at various levels of success, together with the (Yates) corrected 
and uncorrected P values of X? for such a fourfold table are shown in Table 2. 
TABLE 2. Success oF “GLOBAL’’ JUDGMENTS OF SEX IDENTIFICATION 

Correct Match of Number Probability 
One Sex Judges Yates corrected Uncorrected 


8 or less F > Se « Near .70 
9 < .30 — .50 .20 — .30 
10 .10 — .20 .05 — .10 
11 > 02 — .05 01 — .02 
12 <.01 < .01 


The mean value of correct matchings was 9.44 out of 15 possible; corrected and un- 
corrected P values for 9.44 are about .30, and .10 -.20. When only the three trained 
psychologists are used, the mean value of correct matchings was 11.33. P values, 
corrected and uncorrected for this degree of success were <.01 in each instance. 


Individual differences among subjects in their evaluation. The mean number of judges 
who gave proper-sex evaluations for each sex were of course identical; of nine judges 
in each case, 5.7 judged appropriately. There was a tendency for men subjects to 
spread over a greater range (two men were considered male by all judges, one man 
by only one judge; no wo nen were considered female by all judges, while one woman 
was considered male by onlv one evaluator). The standard deviations of numbers of 
judges making proper evaluations for males and females were 2.25 and 1.93 res- 
pectively. More men were judged unanimously to be masculine than were women 
judged to be feminine. 
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Qualitative impressions of variant records. A qualitative, impressionistic examination 
of the eleven records identified most incorrectly and the twelve records most cor- 
rectly identified suggest certain consistent patterns. Two patterns were found among 
the “incorrectly identified records,”’ (at least 5 of the 9 judges identified them in- 
correctly). One of these, which was found in two of the female records, consisted of 
an isolated reference to a feminine activity which could have led to correct identifi- 
cation: one subject spoke of a detail as looking like ‘‘the pole I dance around every 
night . . . just because we do this dance,” and the other said a detail looked like 
“sea horses... I cut out about 55 fora party.”’ This isolated feminine percept was 
minimized by most judges, possibly because of greater emphasis on masculine per- 
cepts, such as, anatomical references, having taken biology, an airplane, percepts 
concerned with violence or death, or a politically oriented percept. The other pattern, 
found in all six of the most incorrectly identified male records, and three incorrectly 
identified female records, consisted in what might be called ‘“‘ambi sexual’’ responses. 

A number of percepts and verbalizations suggested femininity, and a balancing 
number suggested masculinity. Male indicators include masculine vocabulary and 
ways of talking (‘‘that louses up the story’’), technical interests in anatomy, sexual 
references, references to war, etc. Female indicators include artistic interests, at- 
tention to clothing, literary interests, household decorations, interest in dancing, 
interest in children and childish things (‘‘all primped up’’, “‘fluffy’’). 

Among the twelve most frequently identified male and female records (at least 
seven of the nine judges identified them correctly), there were several different pat- 
terns. The most frequent one, found in three female and three male records, consists 
of an overwhelming number of either male or female indicators, with few or none sug- 
gestive of the opposite sex. One female record and one male record had little evi- 
dence in the proper direction, but even less, or no, evidence in the wrong direction, 
so were usually judged correctly. In one male record, a very suggestive response 
(of wandering in the swamp when younger) apparently was so associated with male- 
ness by the raters as to allow them to identify the record correctly despite some 
rather feminine percepts. The final three records which were mostly identified cor- 
rectly seem to be rather evenly balanced between male and female indicators; they 
may well represent lucky guesses by some of the judges. 

Since it is an observable fact that some men have more masculine character- 
istics than others and some women appear to be more feminine than others, it is 
expected that some of the errors in identification reflect real variations in the femin- 
inity or masculinity of the subjects. Thus it would seem a plausible hypothesis that 
the one man whose sex was misjudged by eight of the nine judges was relatively 
feminine and that the one woman whose sex was misjudged by seven of the nine 
judges was relatively masculine. Also it may be assumed that the two men correctly 
identified by all nine judges were masculine and the three women correctly identified 
by all but one of the nine judges were relatively feminine. Unfortunately, there is 
not suflicient information available on these subjects to test this hypothesis. 

It may be noted that the selection of the subjects, college students in a class in 
child psychology, serves to decrease somewhat the differences between sexes which 
might be revealed by the Rorschach test. Thus a particular interest in children and 
the interests of children, which might be considered feminine, may be a poor indica- 
tor when all of the male subjects have elected to take a course in child psychology. 
Female college students frequently get into classes in physics, chemistry, biology, 
zoology, etc., and pick up some familiarity and perhaps even some interest in such 
things. Thus their records in that respect become more like those of the males. In 
general, college men might be expected to have a greater interest than men generally 
in art, literature, reading, etc. Thus their Rorschach responses might have more 
feminine indicators than would be otherwise expected. 


Judgment of sex according to training in psychology. When the correct matchings for 


each sex were evaluated according to the level of training of the judges, it was found 
that the three trained clinical psychologists correctly identified a mean of 11.33 out 
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of 15 possible (p < .01, corrected for continuity). The partially trained psychologists 
correctly identified a mean of 9 out of 15 possible (not significant), and the untrained 
judges also correctly identified a mean of 9 out of 15 possible. This would suggest at 
least a small superiority as a result of training. However, it was noticed that some 
of those who did particularly well had reported taking a rather long time in making 
their judgments. Therefore, each judge was asked to give a rough approximation of 
how long he had taken to read the records and make his judgments. A rank order 
correlation was then computed between the judge’s number of correct matchings and 
his own rough estimate of the time it took him to make the matchings. The resulting 
rank order correlation was .938. This suggests that willingness to take time to work 
carefully through the records was more important by far than level of training. 
Furthermore, it suggests that Rorschach records do contain indications of the sex 
of the testee, but that these indications often are not obvious to a casual or hurried 
observer. At our present level of experience with the Rorschach such judgments 
take considerable time and thought if any high level of accuracy is to be achieved. 


Relationship between evaluation of men as masculine and Rorschach “‘signs’’ of homo- 
sexuality. One theory of a tendency to behave like the opposite sex is that the sub- 
ject in this way is homosexual. We wanted to test this hypothesis, and decided, 
therefore, to correlate the degree to which subjects (men only) who were judged 
frequently to be feminine exhibited the Wheeler signs’. The fifteen men gave an 
average of 2.3 signs, comparable with the mean for the “normal” twenty male 
students examined by Davids, Joelson, and McArthur”). 

The rank difference correlation rho was found to be .14 between the number of 
times a man’s record was judged female and the number of Wheeler signs. 

An appraisal of Wheeler’s signs, particularly as they are presented by Davids 
et al"), suggests that these signs are deviations from the normal masculine response 
not so much in the direction of femininity as toward autism; there are contaminations, 
confabulations, drop in quality of form level, mood qualities, etc., similar to psy- 
chotic responses. 

Considering these few Northwestern men students to be a valid sample, it seems 
appropriate to suggest that apparent femininity in men is not a regression to a less 
mature sexuality. It is a genuine extratension and femaleness of a healthy sort. 
Homosexuality in men is not femininity as much as it is a reversion or regression 
or arrest at an immature level of sexual development, 


SUMMARY AND CONCLUSIONS 


Men and women do not differ significantly in their Rorschach performance as 
scored formally in the usual way. When records of Rorschach performance are 
judged as masculine or feminine on the basis of global impression, better than 
chance sortings can be made by trained psychologists, part'cularly if adequate time 
is given to making the judgments. There is no relationship between the tendency for 
men’s records to be judged feminine and the abundance of Wheeler ‘‘signs’’ of homo- 
sexuality. 

One conclusion from this study is that research with the Rorschach test that 
limits itself to objective scores (such as, location, determinants, etc.) fails to capital- 
ize on aspects of Rorschach performance most meaningful in the study of personality. 
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UTILIZATION OF O’REILLY’S OBJECTIVE RORSCHACH AS A 
SCREENING TEST FOR STATE COLONY JOB APPLICANTS 


MARY MARTHA MURPHY 
State Colony, Woodbine, N. J. 


INTRODUCTION 

O’Reilly’s® modification of the Rorschach Test was devised to differentiate 
among normals, neurotics and psychotics with emphasis upon speed and objectivity 
of scoring. A scale was set up with twelve possible choices for each card. Four typi- 
cally psychotic, four typically neurotic and four typically normal responses were 
selected for each card and the experimental subjects were requested to choose two 
answers for each inkblot. Scores were assigned to each type of response and cut-off 
points were established for the purpose of differentiating between (a) normals and 
neurotics, and (b) psychotics and neurotics. O’Reilly’s experimental group of 50 
normals was composed of nurses, stenographers, clerks and administrative staff 
members of the Saskatchewan Hospital, North Battleford, chosen at random from 
an occupational listing. His neurotic group consisted of 30 recently admitted pa- 
tients who had been diagnosed psychoneurotics of various stypes by a staff confer- 
ence composed of medical superintendent, clinical director and nine staff psychia- 
trists. His psychotic group consisted of 80 patients who had been diagnosed as 
psychotic by staff conferences. 

The present study reports on results of experimentation with O’Reilly’s Ob- 
jective Rorschach as a screening test for job applicants at the State Colony at Wood- 
bine, N. J. Previous investigations have disclosed little evidence of the practica- 
bility of other objectively scored and speedily administered modifications of the 
Rorschach Test. Smith and George“? found that correlations between the scoring 
categories of the Harrower Multiple Choice Rorschach and the individual Rorschach 
were very low. Cronbach? reported that the Group Rorschach, objectively scored, 
failed to predict scholastic success of 200 college students and gave only small cor- 
relations with other criteria of the social and emotional adjustment of the sample 
group. Aldrich and Coffin”) reported that Harrower-Erickson Multiple-Choice 
Rorschach ratings of 212 enlistees in the Navy, Marine Corps and Coast Guard 
failed to disclose it as a practicable method for detecting potentially psychotic in- 
dividuals. 


METHOD 


The Objective Rorschach was individually administered to 60 applicants for 
employment at Woodbine Colony between March 1, 1956 and September 1, 1956. 
Investigation of records on March 1, 1957 disclosed that 30 of the original group of 
job applicants had remained at the Colony as satisfactory employees. Objective 
Rorschach scores of the original 60 job applicants and the 30 remaining satisfactory 
employees were compared. Scores of the 30 employees were compared with scores 
obtained by O’Reilly’s normal and neurotic groups and the statistical significance of 
the differences in scores was ascertained. 


RESULTS 

The original group of 60 job applicants was composed of 33 males and 27 females. 
Fourteen of the 30 satisfactory employees were males and 16 were females. The 
average age of both job applicants and satisfactory employees was 38 with a range of 
18 to 57. The average educational level of both groups was 10 grades with a range of 
5 to 17 years of formal educational background. Twenty-three of the 30 satisfactory 
employees were cottage attendants and 7 were professional and clerical workers. 
Twenty-six (87%) were white and four (13%) were Negro. The mean weighted 
Wechsler-Bellevue I Vocabulary Scale score of both groups was 10 which is the 
average verbal subtest score for Wechsler’s 20-29 year age groups®’. Weighted 
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Vocabulary Scores of the 30 satisfactory employees ranged from 8 to 17 which sug- 
gested that verbal intelligence of the group was distributed between low average and 
superior levels. 

Comparison of Objective Rorschach scores of the 60 job applicants with those of 
the remaining 30 satisfactory employees disclosed little difference in distribution and 
no statistically significant difference in means. Table 1 indicates that the mean 


TABLE 1. CoMPARISON OF OBJECTIVE RorscHacH Scores oF STATE CoLony EMPLOYEE GROUP 
Wits Scores oF NortH BarrLEFoRD NORMAL ANF NEUROTIC GROUPS 


| | 
North Battleford | Woodbine North Battleford 
Scores normals (n = 50) employees (n = 30) neurotics (n = 30) 
% Cum. f % Cum. % f % Cum. % 
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NTIOPONIORe th | ™ 
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Mean score | | 32.8 34 .4°* 


8. D 2.56 4.85 2.1 








*Difference significant at .0009 level **Difference significant at .10 level 


score (32.8) of the Woodbine employee group was almost identical with O’Reilly’s 
cut-off point for normals (33). There was a greater degree of scatter of scores around 
the mean for the Woodbine employees than for O’Reilly’s normal and neurotic 
groups. Scores of 23.2 per cent of the Woodbine employees ranked in O’Reilly’s 
neurotic category (34 to 37) and 16.5 per cent fell within O’Reilly’s psychotic limits 
(over 37). The difference between mean scores of the Woodbine group and O’Reilly’s 
normals was significant above the .01 level. The mean score of the Woodbine group 
was not significantly different from O’Reilly’s neurotic mean at the 5 per cent level. 
These results suggest that the Objective Rorschach is impractical as a screening 
technique for unselected institutional job applicants and appear to support con- 
clusions reported in previous investigations as to the practicability of other object- 
ively scored and speedily administered modifications of the Rorschach Test as a 
screening technique. 


SUMMARY 


O’Reilly’s Objective Rorschach was administered experimentally as a screening 
test to 60 job applicants at a state colony. Investigation of records six months later 
disclosed that 30 of the 60 job applicants had remained as satisfactory employees. 
Comparison of the employee scores with O’Reilly’s norms revealed a statistically 
significant difference between mean scores of the Colony employees and O’Reilly’s 
normals. The mean score of the Colony group was not significantly different from 
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O’Reilly’s mean for neurotic subjects. Scores of 39.7 per cent of the Colony employ- 
ees ranked in O’Reilly’s neurotic and psychotic categories. This was interpreted as 
suggesting that the Objective Rorschach is impractical as a screening technique for 
unselected institutional job applicants. 


REFERENCES 


1. Aupricu, C. K. and Corrin, Mass. Clinical studies of psychoses in the Navy. I. Prediction 
values of social histories and the Harrower-Erickson Test. J. nerv. ment. Dis., 1948, 108, 36-44. 

2. Cronsacu, L. J. Studies of the Group Rorschach in relation to success in the college of the Uni- 
versity of Chicago. J. educ. Psychol., 1950, 41, 65-82. 

3. O Retry, P. O. The Obje ctive Rorschach: a suggested modification of Rorschach technique. 
J. clin. Psychol., 1956, 12, 27-31. 

4. Smrru, 8. and GEORG gE, C. E. The Harrower Multiple-Choice Rorschach: a critique. J. proj. 
Tech., 1954, 18, 507-509. 

5. Wecuster, D. Measurement of adult intelligence. Baltimore: Williams and Wilkins, 1944. 


MOVEMENT IN THE FANTASY OF BRAIN-INJURED 
(CEREBRAL PALSY) CHILDREN 


T. W. RICHARDS* 


Louisiana State University 


PROBLEM AND PROCEDURE 

The purpose of this study is to determine whether children brain injured at birth 
differ from control children in their perception and projection of movement in 
response to a visual stimulus. 

Thirty-two children brain-injured at birth, members of a cerebral palsy group 
studied in New Orleans, were matched for age, sex and IQ with control subjects. 
The age range was from 4.5 to 19.5 years; mean IQ for the brain injured was 97.0, 
for the controls 97.6. Details of the matching and other factors of selection are given 
in a previous report ®?. 

The stimulus used was the Projective Movement Sequence (PMS) developed 
by Lundin“). This consists of a motion picture of eight separate scenes of move- 
ment of iron filings over a plane surface. Following each scene the subject is asked 
“what happened?” Reactions to the movement sequences were scored for (a) 
source of energy (external, internal, disintegrative, static), (b) direction of move- 
ment (extensor, flexor, diverse), and (c) content (human, animal, botanical, physiolo- 
gical, nature, etc.). Reliability of scoring of the PMS was determined by duplicate 
scoring by two observers of twelve of the sixty-four records. They agreed perfectly 
on 83% of one hundred and twenty responses. 


RESULTS 
In Table 1, are shown means, medians and sigmas for each of the groups of 
brain-injured and control children, together with probability data for the significance 
of differences. 
*The author is indebted to Sara W. Hooper, M.A.., for assistance in testing, and to Dr. Frank J. 
Loeffler for help in scoring and statistical analysis. 
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TaBLe 1. MEANS, MEDIANS AND SIGMA FOR Various Scortnac CaTEGorrES OF PMS For Bratn- 
INJURED AND CONTROL SUBJEC ts, TOGETHER WITH DEGREE OF SIGNIFICANCE OF DIFFERENCES 
BETWEEN THem. 
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Comparison of the PMS scores with Rorschach scores for the same children 
showed that whereas all subjects gave some PMS movement, twenty-eight percent 
of all subjects gave no movement with the Rorschach. Correlation of movement 
given by PMS and Rorschach was .25 for brain-injured and .30 for control subjects. 


CONCLUSION 


A general finding of this study of differences between children brain-injured at 
birth and controls is that with a motion-picture stimulus of movement the brain- 
injured children were as spontaneous as controls. : 

Like the uninjured child the brain-injured subject tends to identify motion as 
that of animals, humans, plants and objects. He differs from the uninjured in show- 
ing total avoidance of movement as a physiological phenomenon, and in using move- 
ment that is self-initiated and extensor rather than flexor and initiated by an out- 
side source. The brain-injured child, in his fantasy of activity, is much like the un- 
injured child, but he shows a significant tendency to disavow the physiological 
nature of activity, and to experience it as outgoing and active rather than passive. 
In his fantasy he denies that movement is a matter of his own physiology; rather, 
movement is an aggressive experience. This substantiates a theory that fantasy 
serves as wish fulfillment, or at least, compensation for motor handicap. 
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DIFFERENTIATING ALCOHOLICS FROM NORMALS AND 
ABNORMALS WITH THE MMPI! 


DONALD P. HOYT AND GORDON M. SEDLACEK 
Kansas State College Mental Health Center, Cedar Rapids, Iowa 


PROBLEM 

A number of studies purporting either to describe the personality dynamics of 
alcoholics or to predict chronic alcoholism as a behavioral characteristic have ap- 
peared in the literature. (** % ® 7 8 In general these studies have been seriously 
limited by utilizing small, non-random samples, by studying only members of Al- 
coholics Anonymous (probably an unrepresentative group of alcoholics), and by 
neglecting to differentiate alcoholics from other clinical groups. The present study 
has as its purpose the identification of personality characteristics of alcoholics which 
distinguish them from non-alcoholics and from other clinical groups. 


SAMPLES 


Various samples were used in different parts of this study. These samples are 
listed below, together with an abbreviation code used in the remainder of this report. 


A. Alcoholic subjects. Two separate samples were used. Both met the following criteria: (a) 
white males; (b) fifth grade education or its equivalent; (c) diagnosis of chronic alcoholism by a 
physician and/or a clinical psychologist. 
1. The first sample (A-1) consisted of the first 98 cases at the Mental Health Institute 
in Independence, Iowa,? who met the criteria described above. 
2. Sample 2 (A-2) was composed of 79 cases from the Willmar State Hospital, Willmar, 
Minnesota,* who met these criteria. 


B. Normal subjects. Three samples of subjects with no known personality disturbance were used 
in various phases of this study. 
1. The first sample (N-1) wag a group of 50 male V. A. on-the-farm trainees, utilized 
in an unpublished master’s thesis by Marsh °8) 
2. Sample 2 (N-2) consisted of the 258 normal Minnesota males selected by Hathaway 
for research on profile patterns of the MMPI®). 
3. The final normal sample (N-3) consisted of the 139 normal Minnesota males and 54 
V. A. hospital males used by Hathaway and McKinley in the development of their original 
scales ©), 
C. Clinical subjects. Four samples of subjects who had been diagnosed in various clinical groups 
were utilized. 
1. Sample 1 (C-1) was composed of 33 consecutive numbered cases from the Mental 
Hygiene Clinic at Fort Snelling, Minnesota.‘ In each case the primary diagnostic impression 
was psychoneurosis. 
2. The second sample (C-2) was composed of the 24 male “moderate psychoneurotics”’ 
described by Gough"). 
3. Sample 3 (C-3) consisted of Gough’s 22 male psychotic patients“). 
4. The final clinical sample (C-4) was Hathaway’s group of 710 males with various 
psychiatric diagnoses selected for research on MMPI profiles). 


PROCEDURE 
Three methods of accomplishing the major objective of the study were employ- 
ed, all involving the use of the MMPI records of the various subjects. These meth- 
ods were (a) item analysis, (b) mean profile comparisons, and (c) pattern analysis. 
Profiles were discarded if the ? or L score was as high as 70 or if the F-K raw score 
was as large as 9°, 


Item Analysis. In this method, the percentage of ‘‘plus responses’”’ for each of the 
566 MMPI items was determined for the A-1 sample. These percentages were com- 
pared with those for the N-3 sample®?. Zubin’s nomographs“") were used for de- 
termining the significance of the difference. In order to maximize practical as well as 
statistical significance, two criteria for item selection were defined. First of all, only 


‘This report is based on the M.S. thesis of the junior author written under the supervision of the 
senior author. 

*The cooperation of Dr. Donald Iyer in making these records available is gratefully acknowledged. 

*These records were made available by Mr. Warren D. Roberts of the Willmar State Hospital, 
Willmar, Minnesota. 

4Dr. Carl Jenness and Dr. Daniel Wiener generously provided these records. 
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those items where the percentage difference was significant beyond the .01 level were 


included. 


Secondly, of this group, only those where the percentage difference reach- 
ed or exceeded 15 per cent were selected. 


Mean Profiles. Samples N—1, C-2, and C-3 wore compared with the combined A-1 


and A-2 samples on the basis of mean scores obtained on each of the nine clinical 
scales of the MMPI. 


Pattern Analysis. 


The frequency with which various MMPI patterns occurred in the 


N-2, C-4, and combined A-1 and A-2 samples were compared. 


Item Analysis. 
These items are listed in Table 1, along with the scoring direction. 


TABLE l. 


RESULTS AND DISCUSSION 


ITEMS 


Booklet 


Number 


26 
39 
46 
61 


Item 


I feel that it is certs ainly be 2st to keep my mouth shut 
when I’m in trouble 


At times I feel like smashing things. (F) 
My judgment is better than it ever was. (F) 
I have not lived the right kind of life. (T) 


I do many things which I regret afterwards 
more or more often than others seem to 


I regret 


I go to church most every week. (F) 

I have met problems so full of possibilities that I 
have been unable to make up my mind about them 
T) 


My hardest battles are with myself. (T 
I know who is responsible for most of my troubles. 
T) 


I do not worry about catching diseases. (T 

I like to cook. (T) 

I would like to be a soldier. (F) 

At times I feel like picking a fist fight with someone. 
(Pr) 

I am neither gaining nor losing weight. (F) 

I have used alcohol excessively. (T) 

I think I would like the work of a contractor. (T) 
It is not hard for me to ask help from my friends 
even though I cannot return the favor. (’ 

My relatives are nearly all in sympathy with me. (F) 
I have been disappointed in love. (T) 

I am entirely self-confident. (F) 

I have very few fears compared to my friends. (F 

I am always disgusted with the law when a criminal 
is freed through the arguments of a good lawyer. (F) 
I am not likely to speak to people until they speak 
to me P 

I have never been in trouble with the law. (F) 
There never was a time in my life when I liked to 
play with dolls. (F) 

I worry over money and business. (F 

My father or mother often made me obey when I 
thought it was unreasonable. (F) 

I feel anxiety about something or someone almost all 
the time F) 

I usually have to atop 1 and think before I act even 
in trifling matters 


I have a habit of counting things that are not im- 
portant such as bulbs on electric signs, and so 
forth. (F 

I tend to be on my guard with people who are some- 
what more friendly than I had expected. 
I get anxious and upset when I have to make a 
short trip away from home 

Sometimes some unimportant thought will run 
through my mind and bother me for days. (F) 


I am inclined to take things hard. (F) 


I feel uneasy indoors. (F) 


) indicates the scoring disection for the item. 


Using the crite1ia described above, a total of 68 items were obtained. 


Responses of the 


INCLUDED IN THE ALCOHOLIC SCALE, BOOKLET NUMBER, AND KEYED DIRECTION 


Booklet 


Number 


366 
375 
378 
383 
386 


= 


387 


Item 


Even when I am with sneite I feel lonely much of 
the time. (F) 


When I am feeling very hs Appy and active someone 

who is blue or low will spoil it a F 

I do not like to see women smoke. (F) 

People often disappoint me. (F) 

I like to keep people guessing what I’m going to do 

next i 

The pot miracles I know are simply tricks that 

other people play on one another. (F) 

It makes me feel like a failure when I hear of the 

success of someone I know well. (F) 

If given the chance I would make a good leader of 

people. 

I have had some very unusual religious experiences. 
F) 


One or more members of my family is very nervous. 
F) 


I am embarrassed by dirty stories. (T) 

I have strong political opinions. (F) 

I used to have imaginary companions. (F) 

People usually demand more respect for their own 
rights than they are willing to allow for others. 

It is all right to get around the law if you don’t 
actually break it. (T) 

I enjoy gambling for small stakes. (T) 

I have one or more bad habits which are so strong 
that it is no use in fighting against them. (F) 

I have used alcohol moderately (or not at all). (F) 

I have several times had a change of heart about my 
life work. (" 

I am fascinated by fire. (F) 

Whenever possible I avoid being in a crowd. (F) 

If I were in trouble with several friends who were 
equally to blame, I would rather take the blame 
than to give them away. (T) 

Christ pe rforme d miracles such as changing water 

into wine 

It is unusual for me to express strong | aogueren or 
disapproval of the actions of others 

I have had periods when I felt so full “ pep that 
sleep did not seem necessary for days at a time. : (F) 

I think Lincoln was greater than Washington. (F) 

Some of my family have quick tempers. (F) 


I am not afraid of picking up a disease or germs 
from door knobs. (T) 


I am not bothered by c great deal of belching of gas 
from my stomach 

If I were an artist I aw like to draw children. (T) 
I sometimes feel that I am about to go to pieces. (F) 
A large number of people are guilty of bad sexual 
conduct. (F) 


I am gres atly bothered by forgetting where I put 
things. 
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A-2 group were scored on the scale thus obtained in a cross-validation effort. This 
sample averaged slightly higher (28.4 vs. 25.5) than the A-1 sample. Apparently the 
rigorous selection technique provided an effective safeguard against validity shrink- 
age. 

Two other questions were investigated. First, will the scale differentiate ade- 
quately between alcoholics and normals? Second, will the scale differentiate ade- 
quately between alcoholics and other clinical groups? To answer the first of these 
questions, the MMPI responses of the N-1 sample were scored on the new scale. 
The mean score of the normal group (20.0) was significantly different from that of 
both the A-1 and A-2 groups (¢ values were 9.21 and 14.12). By establishing a critical 
score at 24, 76 percent of the alcoholics and 80 percent of the normals were correctly 
identified. The second question was answered only partially. MMPI responces of 
the C-1 sample were scored on the new scale. The mean score obtained (26.4) was 
not significantly different from that of the alcoholic samples. 


Mean Profiles. For purposes of comparing scores obtained on the nine clinical keys 
of the MMPI, samples A—1 and A—2 were combined into a single sample of 177 alco- 
holics. This procedure was not entirely justified, since the A-2 groups did score sig- 
nificantly higher than the A-1 group on Pt, Sc, and Ma. The largest of the mean 
raw score differences was less than 5, however, and this, plus the fact that the general 
shape of the mean profiles was very similar for both samples (Pd and D above 60 
but below 70, other scales above 50 but below 60) was used to justify combining the 
two samples in order to obtain a larger N. 

Figure 1 provides visual comparisons of the mean profile of the alcoholic samples 
with that of normals (N-1), moderate psychoneurotics (C—2), and psychotics (C-3).§ 
Two general conclusions from these figures are obvious. (a) Normals and alcholics 
do not show markedly different mean profiles on the MMPI. The chief differentiat- 
ing scales (Pd and D) suggest at best only modest differences in mean scores. (b) On 
the other hand, each of the two clinical samples obtained mean profiles widely dis- 
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Fig. 1. Comparison of the Mean Scores of 177 Alcoholics with 2h Moderete Psychoneurotics (1) 
22 Fsychotics (1), end 50 Normais (8) : Nis 


‘Reprinted with permission of Dr. Harrison Gough and_the’J.-clin. Psychol. 
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parate with that obtained by the alcoholic group. In the case of the psychoneurotic 
group (C-2), only Pd and Mf failed to differentiate, and marked differences on the 
neurotic triad were obtained. The differentiation was even more general in the case 
of psychotics (C-3), where Mf was the only scale on which similar mean scores were 
obtained. 


Pattern Analysis. An obvious weakness in the preceding analysis is that the mean 
can conceal more than it reveals. To avoid this pitfall, profiles from the A—1 and 


TABLE 2. FREQUENCY oF Various MMPI Parrerns** ror ALCOHOLICS, NORMALS, AND 
PsycHIATRIC PATIENTS® 





Alcoholics (N-177 Normals (N-258) Psychiatric (N-710) 
Pattern T Scores T Scores T Scores 
Codes Above 54 Above 70 | Above 54 Above 70 Above 54 Above 70 
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.60 0.00 23 .60* 0. 80 0.00 
00 0.00 1.30* 0. .00 0.00 
.00 0.00 3 .50* 2.3 10* 5.80* 
30 0.60 0.40 0. .40* .30 
.60 0.00 0.40 0. 60 ).60* 
.60 0.00 1.60 0. .30 se 
.00 0.00 0.80 0.06 .80* .60* 
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| 
~ 


© 0 ~I if Go 


30 2.30 0.80 ‘ ) .60* .20* 
80 10 1.20 : 3.80 .20* 
.60 2.80 0 .40* . | 2.40* 
.60 ‘ .00 0.80 10 
40 2.30 0.80 ‘ .90* 


.70* 
.10* 
40* 
.50* 
.40* 
60 
.00* 


hbo bo 
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.00 2. 40* 
60 .80* 
“50 3.50 
20 .00* 
80 40 
00 90 
10 .90* 


‘tie 
UF 


4 

4-2 
4-3 
4-6 
4-7 
4-8 


DIED 
Oke Doh O 


= 
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00 .00 .20* | 00 

10 | 40 , .10* 
.00 00 | 2.30° | .70* 
.00 00 | 40 ; .00* 


PHARMD 
i | | ' 
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ooro 


.00 00 | .60* .00 
.00 .00 .20* .60* 
.70 60 20 3.80 
60 60 .20 8 | .40* 
.00 00 | .20* | 30 
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.00 .00 1.20* 0. .10* 
.00 00 0.80 0 .70* 
00 .00 1.20* 0.00 | .80* 
.00 00 0.80 0.00 .10* 
60 .00 0.40 0.00 3.20* 


‘ 


1 
a= 


QO 90 QO G0 G0 
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80 .00 9.40* 0.80 1.10 
00 .00 0.00 0.00 0 .60* 
00 00 0.40 0.00 1.00* 
00 .00 0.80 0.00 0.80* 
.00 .00 | Loo 0.40 0.10 
.60 60 0.80 0.40 2.10* 





*Significantly different from the alcoholic percentage at the .05 level or lower. 
**Patterns where neither the normal nor psychiatric samples differed significantly from the alco- 
holic have been omitted. 


‘Reprinted with permission of Dr. Starke Hathaway and the University of Minnesota Press. 
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A-2 samples were coded according to Hathaway’s system“, and the frequency of 
such codes was compared with Hathaway’s data for normal (N-—2) and psychiatric 
(C-4) samples. In accordance with Hathaway’s system, special note was taken of T 
scores above 70. These comparisons are seen in Table 2. Several generalizations are 
apparent from this table. 


1. The alcoholic and the psychiatric groups differed significantly in the 
patterns exhibited. Using the five percent level of confidence, we would expect 
3.25 significant differences by chance; actually, 26 such differences were found. 

2. The alcoholics and the normal group differed significantly in the pat- 
terns exhibited. Using the five percent level of confidence, we would expect by 
chance to announce 3.25 significant differences; actually, 18 such differences 
were found. 

3. The percentage of alcoholics exhibiting scores > 70 was significantly less 
from the psychiatric sample. 

4. The percentage of alcoholics exhibiting scores of 70 or more showed no 
difference from the normal sample. Conclusions 3 and 4 confirmed the earlier 
findings in terms of the diagnostic significance of profile elevation. Markedly 
elevated profiles tended to belong to the psychiatric samples much more fre- 
quently than to either the alcoholic or normal samples. 

5. The Pd scale (scale number 4) was consistently the highest peaked scale 
for the alcoholic. 


The heterogeneity of the psychiatric group in terms of specific diagnoses makes 
the comparisons in Table 2 less meaningful. [It seems reasonable to expect that, 
in the future, pattern listings for various syndromes will be available, making poss- 
ible much better comparative information. 


SUMMARY AND CONCLUSIONS 


The major purpose of this study was to identify, by use of the MMPI, person- 
ality characteristics of alevholics which differentiate them from normal non-alcoholics 
and from other clinical groups. Three methods of accomplishing this purpose were 
employed. First, an item analysis of the MMPI responses of a sample of alcoholics 
and a sample of normals was performed; the resulting scale was subjected to further 
analysis. Second, mean scores of alcoholics on the nine clinical scales of the MMPI 
were compared with mean scores of normals, neurotics, and psychotics. Finally, the 
frequency of various MMPI patterns for groups of alcoholics, normals, and psy- 
chiatric cases were compared. Within the limits of the samples available, it is con- 
cluded that: 

1. The special! scale developed in this study correctly identified 75 to 80 percent 
of alcoholics and normals under cross-validation conditions. 

2. This scale did not discriminate between alcoholics and a group of psycho- 
neurotics. 

3. Mean MMPI profiles for alcoholics were, in general, quite similar to that for 
normals, although Pd and D were somewhat elevated for the alcoholic sample. 

4. Samples of psychoneurotics and psychotics obtained much higher mean 
scores on the MMPI scales than did alcoholics. 

5. These four findings suggest that the three groups (normal, alcoholic, and 
“clinical’’) can be differentiated by a combination of the MMPI scales and the 
special “‘alcoholic’”’ scale. 

6. Certain MMPI patterns occurred more frequently in the alcoholic sample 
than in normal and “‘psychiatric’’ samples, particularly in the Pd scale. 

7. Lack of clearly defined samples requires caution in interpreting these findings 
and argues for additional research with better defined groups. 
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THE EFFECT ON SELF DESCRIPTION OF COMBINING ANXIETY AND 
HOSTILITY ITEMS ON A SINGLE SCALE! 


MARVIN J. FELDMAN AND SAUL M. SIEGEL 
University of Buffalo Topeka State Hospital 


PROBLEM 


In the extensive use of personality questionaires and ratings, little attention has 
been paid to the possibility that the combination of items being rated may influence 
ratings. Rather it is generally assumed that ratings depend mainly on the person- 
ality of the subject being rated and notions about personality held by the rater. It 
seems legitimate to inquire, however, whether each item in a test is responded to 
independently or is to some extent influenced by the nature of the preceding items. 
The latter view would suggest that the order and composition of items on a scale have 
some bearing on the personality portrait which emerges from ratings. 

Asch“? has conducted experiments which throw some light on the problem. He 
found that systematic changes in responding to the same attribute occurred when 
simple switches were made in the order in which the attribute was presented and 
when the total composition of the items was altered. For example, ‘‘cold’’ has a quite 
different meaning for raters when embedded with: intelligent - skillful - industrious - 
cold - determined - practical - cautious than in the series: intelligent - skillful - sincere 
- cold - conscientious - helpful - modest. In Asch’s study the judges were not rating a 
real person but rather were forming impressions of a hypothetical case from a list of 
attributes. With such a task, the interaction of items might exert the maximum in- 
fluence on rater response. A rating of a real person or a self rating might limit the 
interaction effect of items as an influence on ratings. 

The present study was designed to test the concept that the total composition of 
a test affects responses to the component parts of the test. The limiting case of self 
ratings was used. Presumably the rater’s self evaluations would be least influenced 
by side comparisons among items. Hence, if item composition influences self ratings, 
ratings of others can be expected to be similarly influenced. 


'This article is based upon a paper delivered at the Eastern Psychological Association Meeting, 
April, 1957. The writers wish to acknowledge the help of Miss Sherry Carpenter and Mr. Richard 
Remy in computing the statistics. 
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MATERIALS 


. To test the effects of item composition on response, three related personality 
inventories were used: (a) The Taylor Manifest Anxiety Scale (MAS) - 50 items per- 
taining to anxiety plus 175 buffer items’; (b) The Siegel Manifest Hostility Scale 
(MHS) - 50 items pertaining to hostility plus the same 175 buffer items“, and (c) 
Combined form - 50 MAS, 50 MHS, plus 125 of the same buffer items. 

Siegel“) devised the MHS following the same procedure utilized in the con- 
struction of the MAS, that is judges agreed that certain MMPI items reflected mani- 
fest hostility. In all three forms the scale items were distributed randomly among the 
buffer items. Anxiety and hostility were chosen as the variables to test our hypothe- 
sis because they are clinically relevant variables which might reasonably be expected 
to exert some mutual influence. Nevertheless, we could not state in advance what 
direction we expected the effect to take. 

SUBJECTS 

The subjects were mainly freshmen and sophomores enrolled in an introductory 
psychology class. The tests were administered as part of the regular class work. Men 
outnumbered women, 67 to 29, but the respondents were distributed fairly evenly in 
the subgroups to control any possible effect due to the sex of the subject. The aver- 
age age of the subjects was 19.8 years with little variability. Some subjects were ex- 
cluded on a random basis to equalize N’s in the subgroups. 

PROCEDURE 

The tests were administered in two sessions, one week apart. Two separate 
classes were used for replication purposes. Each class was told that the examiner was 
collecting normative data on a new personality test. The subjects thought they were 
all taking the same test. Actually in the first testing, half of each class took the com- 
bined form, containing both MAS and MHS items, a quarter of each class took the 
form with MAS items, and the remaining quarter took the form with MHS items. A 
week later subjects were tested again ostensibly so that the reliability of the test could 
be assessed. On the second administration, subjects who previously took the com- 
bined form now received one of the other forms containing either the MAS or MHS 
items. Conversely, subjects who took either the MAS or MHS initially now received 
the combined form. Thus each subject took two of the three forms. All 96 subjects 
took the combined form, 56 took the separate MHS form and 40 the separate MAS 
form. None of the subjects suspected that the second test differed somewhat from 
the first test. 

RESULTS 


Before presenting the main results, it is of interest to examine the reliability of 
the measures. It is possible that the reliability of the measures differs in the com- 
bined compared with the separate forms of the scales. It is also possible that re- 
liability might be affected by the order in which the forms were taken. With the 
— design, both test-retest and split-half reliability coefficients could be com- 
puted. 

Table 1 indicates that the reliability coefficients were fairly high. Test-retest 
coefficients were somewhat higher than split-half coefficients even when the latter 


TABLE 1. ReviaBitity COEFFICIENTS ON MAS anp MHS 


Form 


MAS 


MHS 

Test-retest: 

separate vs. combined form .86 .78 

separate first vs. combined second 85 84 

combined first vs. separate second .89 76 
Split-half: 

separate form 64 ; .66 

combined form .62 





*corrected by Spearman-Brown formula 
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were corrected by the Spearman-Brown formula. Frequently with personality tests 
the reverse holds true. Neither the form of the scale, nor the order in which tests 
were taken seems to have affected the reliability. The present test-retest coefficients 
compare favorably with other results reported ©. The present split-half correlation 
on the MAS is somewhat lower than results reported by Hilgard, Jones, and Kap- 
lan®?, but on the MHS, the coefficients are about the same as was found in the 
initial study“. In general, however, the reliability coefficients are high enough to 
permit confidence in using these measures in the present study. 

To test the main hypothesis, several analyses of variance were computed using a 
Type ILI, mixed design described by Lindquist “. This design conveniently tests the 
main assumptions of the study while also permitting an analysis of two control fact- 
ors inherent in the study, namely the study of possible differences between classes—a 
replication factor—and possible differences stemming from the order in which the 
two forms of the test were taken, that is the separate MAS or MHS first or the com- 
bined form first. 

Table 2 presents the results from the analysis of the MAS scores. None of the F 
ratios were significant. Thus it seems that MAS scores were not affected by in- 
clusion of hostility items. The two classes gave essentially the same results, and the 
order in which forms were taken had no effect. 

A similar analysis was carried out to determine the influence of anxiety items on 
MHS scores. Table 2 indicates that MHS scores did not vary in terms of the contro, 


TABLE 2. ANALYSIS OF VARIANCE OF MAS anp MHS Scores 


MAS Scores : | MHS Scores 


| 


~ 
Qo. 
—_ 


F 


Source df 





Between 8: 39 
B — order 1 
C — class 1 


qu 


Nee OF 


BC 

error (b) 3€ 

Within 8S: | 40 
A — form of test 

AB | 

AC 

ABC | 

| 


1 
5 
1 
1 
1 
1 
6 


uo 
te 


error (w) 


Total 


— 
_ 
-_ 








*significant > .05 < .10 level 
**significant at .05 level 


variables, different classes or order in taking the two forms of the test. The main 
hypothesis is born out to some extent, however, for MHS scores were reduced some- 
what by inclusion of anxiety items. Thus the analysis of variance comparing the 
separate with the combined form of the MHS yields an F ratio of 3.61 which is signi- 
ficant between the 5 and 10% level. Furthermore, the effect is more pronounced 
when subjects gain familiarity with the scale by taking the separate MHS form first, 
for there is a significant interaction at the .05 level between form of the scale and 
order in taking the two forms. To facilitate comparisons with other studies using 
these scales, Table 3 presents the means and standard deviations of the various sub- 
roups. 
. The results obtained suggest that combining subscales had only a minimal effect 
on responses to either scale. It seemed possible that the effect might be stronger in 
the second half of the scale after subjects built up a set on the first half. To test this 
possibility the scales were all rescored on the last 25 items only. The same statistical 
operations were then carried out. The results, however, were in substantial agree- 
ment with the main findings so do not seem to warrant a detailed analysis. Again 
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TaBLe 3. SusGRoup MEANS AND STANDARD DEVIATIONS ON THE MAS anp THE MHS 


MAS (N = 10 pergroup) MHS(N = 14 per group) 
Group M SS) M 5 





Separate form first — class A 13. 8.: 5.7 
Combined form second — class A 13.3 73 3.4 


Combined form first — class A 2. . aa 
Separate form second — class A 5.2 ‘ 


Separate form first — class B 5. 
Combined form second — class B F 3.6 


Combined form first — class B : 9.6 11.8 
Separate form second — class B 5. ; 12.6 





the inclusion of hostility items seemed to have little effect on responses to MAS 
while inclusion of anxiety items lowered scores on the MHS, but the F ratio was still 
only significant at the .05 to .10 level. 


SUMMARY 


This study investigated the possibility that responses to test items might be in- 
fluenced by inclusion of different item content in the same test. It was found that 
scores on a scale of manifest hostility were lowered when the total test also included 
anxiety items. The effect was not general for scores on a scale of manifest anxiety 
remained constant when hostility items were added to the total test. 
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FACE VALIDITY IN THE D SCALE OF THE MMPI 
RAYMOND J. MC CALL! 


Marquette University 


PROBLEM 

In an earlier study Doleys“? found the D scale of the MMPI to be of moderate 
validity (p < .05) in discriminating a group of 41 depressives from a matched group 
of non-depressive psychotics. An item analysis of his results showed, however, that 
sizable portions of the D scale were of little or no differential value while other blocs 
of items seemed quite effective. Aside from noting the inutility of the so-called 
“correction’’ items®), Doleys was unable to group the discriminating and the non- 
discriminating items into very meaningful classes. 


1Thanks are due to E. J. Doleys Jr., who compiled the original data, and to C. L. Klingberg and 
P. J. Vitu for assistance with statistical computations. 
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METHOD 


Independently of Doleys’ analysis, the writer decided to test on these data the 
hypothesis that the most discriminating items would be those which come closest 
to describing the known symptomatology of depression; that items less essential to 
the syndrome would be less discriminating; and that, finally, items inconsistent with 
or irrelevant to the depressive syndrome would contribute least to distinguishing 
diagnosed depressives from non-depressive psychotics. Accordingly, the 60 items of 
the D scale were broken down a priori into 3 groups: 26 “‘face valid’ (FV) items, 
22 “‘congruent”’ (CG) items, and 12 “‘irrelevant’’ (IR) items. These items as scored 
for depression may be identified as follows: 


I. “Face Vauip” ITems 


B2 Not understanding what I read as well as formerly 
Not enjoying different kinds of play and recreation 
Wishing I could be as happy as others 
Not feeling that life is worthwhile 
Finding daily life not interesting 
Having difficulty in starting to do things 
Having periods when I could not get going 
Brooding a great deal 
Crying easily 
Not caring what happens to me 
Not being happy most of the time 
Finding it hard to keep my mind on a task 
Feeling useless at times 
Not being in as good physical health as friends 
Having felt better ef sd 
Worrying about health 
Not having a good appetite 
Troubled by constipation 
Not going to sleep without troublesome thoughts 
Not being as capable and smart as most others 
Judgment not better than ever 
Lacking self-confidence 
Not having periods of unusual cheerfulness 
Not feeling full of energy 
Not feeling excitedly happy even when things go wrong 
Not being as able to work as ever 


II. “ConGrRuENT”’ ITEMS 


Worrying about locked doors and windows when leave home 
Not teasing animals 

Not dreaming frequently about things best kept to myself 
Blaming a person for taking advantage of someone 

Being terribly hurt by criticism 

Worrying about catching diseases 

Never laughing at a dirty joke 

Not having been well most of the last few years 

Either gaining or losing weight 

Often feeling weak all over 

Sleep fitful and disturbed 

Being easily awakened by noise 

Never feeling like smashing things 

Never feeling like picking a fist fight 

Never feeling like swearing 

Believing I am more nervous than most 

Memory not seeming all right 

Being afraid of losing my mind 

Not liking to flirt 

Passing by other than close friends unless they speak first 
Not being a good mixer 

Working under a great deal of tension 


III. “IRRELEVANT” ITEMS 


Having had a fit or convulsion 
Not sweating easily on cool days 
Not sweating when embarrassed 
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*B6 Not having spells of hay fever or asthma 

*BS Never having urinated or coughed up blood 

*D10 Not going to church regularly 

*D18 Not believing in the second coming of Christ 

*D22 Not believing things are turning out as predicted by the Biblical prophets 
*G37 Not keeping on at things till people lose their patience 

C33 Not agreeing that most people are hard to convince of truth 

Al5 Being troubled by attacks of nausea and vomiting 

C31 Not standing in other people’s way because of principle 


* “Correction’’ item 


This subclassification of the D scale items may be regarded as purely “intuitive’’, 
since no formal attempt at empirical validation was made. The fact that 9 of 12 
items classified as “irrelevant” are from the 11 “‘correction’’ items of Hathaway and 
McKinley, or that most of the FV items may be found attributed to depressives in 
the standard text books, reveals something about the author’s procedure but is in no 
way crucial to the empiriological design. 


RESULTS 
Considering mean number of items answered in the scored direction by the de- 
pressives and the non-depressive psychotics, t-ratios set forth in Table 1 show that 
the 26 FV items differentiated the two groups at least as well as did the scale as a 
whole. 


TABLE 1. MEAN DIFFERENCES IN NUMBER OF SCORED D SCALE 
ITEMS FOR DEPRESSIVES AND NON-DEPRESSIVES 


Item Classification Mean diff 

Total D Scale 2.27 
(60 items) 

FV items 3.11 
(26 items) 

CG items 2.18 
22 items) 

FV + CG items 
(48 items) 

IR items 
(12 items) 


*p 








To test further the apparent trend toward lesser discriminating value as the 
items departed from face validity (which the t-test suggested but did not prove), 
the differences were analyzed non-parametrically according to the Wilcoxon signed- 
rank technique. The results are summarized in Table 2. 


TABLE 2. SIGNED-RANK DISCRIMINATION OF DEPRESSIVES FRoM 
Non-DEPRESSIVES IN THE MMPI D Sca.e 


Item Classification 
Total Scale -—3 .64 
FV items -3 .50 
CG items —2.16 
IR items - .81 


Value of normal deviate p 








The proportion of depressives and non-depressives answering each type of item 
in the scored direction was also analyzed by the analysis of variance technique 
(Table 3), each proportion having been first transformed into an are sine equivalent. 
As a further check, the analysis was repeated (Table 4), this time considering pro- 
portion of items of each type answered in the scored direction (as opposed to pro- 
portion of persons answering each type of item.). The results cohere perfectly, and 
are consistent with the trend indicated by the signed-rank test. 
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TABLE 3. ANALYSIS OF VARIANCE FOR PROPORTION OF DEPRESSIVES 
AND NoON-DEPRESSIVES ScorING PosITIVELY ON THE MMPI D Sca.e 


Source Sum of Squares df Mean Sq. 





Item Type 
Diagnostic category 
Interaction 

Within 


8 .02** 
4.74* 





Item Types 81 2 41 13 .00** 
Diagnostic categories ome 1 17 5 .49* 

Interaction .09 2 .045 

i 0 

5 





Within 7.51 24 
Total 8.58 24 





**) < .001 
*p < .05 


SUMMARY AND CONCLUSIONS 


The 60 items in the depression scale of the MMPI were shown to be differentially 
effective in distinguishing depressive from a matched group of non-depressive psy- 


chotics in proportion to their “face validity’, as determined a priori. 
The implication that the scale might be improved for purposes of differential 


diagnosis by elimination of “‘irrelevant’’ items is probably worth further investi- 
gation. 
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THE TAYLOR MANIFEST ANXIETY SCALE IN 
DIFFERENTIAL DIAGNOSIS! 


HAROLD RUBIN AND ALAN H. TOWNSEND 


Veterans Administration Hospital University of Michigan 
Lebanon, Pennsylvania 


PROBLEM 


The Taylor Manifest Anxiety Scale (MAS) was originally devised as a measure 
for exploring the relationship between performance in learning situations and effec- 
tive drive level“. More recently, however, the MAS has become rather widely 
used as a measure of clinical anxiety, and numerous studies evaluating its validity 
have been reviewed by Taylor“. In general they seem to suggest that the MAS 
bears some relationship to clinically observed anxiety. 

In one of the few studies on hospitalized psychiatric patients, Buss, et al. @? re- 
ported a correlation of .60 between the MAS and over-all ratings of anxiety obtained 
from a pool of judges. Taylor states“? “. . . the experimental results obtained with 
the anxiety scale might also hold for groups selected according to clinical criteria.”’ 
In a recent study by Siegman“?, the MAS scores of various psychopathological 
groups indicated that patients with a diagnosis of anxiety reaction had significantly 
higher MAS scores than patients in any of the other groups, and also that patients 
with a diagnosis of psychopathic personality had significantly lower scores than any 
other group. Rubin, et al. in a study on hospitalized psychotics found that patients 
who were rated as well-adjusted on the Hospital Adjustment Scale“? were relatively 
free of expressed manifest anxiety symptoms, obtaining low MAS scores. On the 
other hand, poorly adjusted patients were found to express both little and much 
manifest anxiety, and their MAS scores were unpredictable, ranging from very low 
to very high. 

It would seem to follow then that patients with psychiatric diagnoses of ‘‘anxiety 
reaction’’ should obtain higher MAS scores than patients with other psychoneurotic 
diagnoses but in whom manifest anxiety was not the predominant symptom“. It 
would also appear that better adjusted patients obtain lower MAS scores than the 
more poorly adjusted ones. The present study attempts to explore these hypotheses 
in evaluating the use of the MAS in the differential diagnoses of hospitalized pa- 
tients. 


SUBJECTS AND PROCEDURES 


Subjects were 239 hospitalized, male veterans on the admission ward of the 
neuropsychiatric service. Eighty-two subjects were diagnosed as psychoneurotics 
and 157 subjects as schizophrenics. Of the psychoneurotics, 57 were diagnosed as 
“anxiety reaction” and 25 carried diagnoses other than “anxiety reaction’). The 
schizophrenic group consisted of 82 patients who were scheduled to receive electro- 
shock therapy and 75 patients who were admitted at a later date, when shock therapy 
was no longer a prevalent form of treatment on the neuropsychiatric service. 

The shock group was included here and separately, because it was felt that their 
findings would contribute meaningfully to this study. It seemed that the decision to 
administer electroshock treatment was based in part on the clinical symptoms pre- 
sented at the time of admission and these in turn seemed to reflect the type of item 
sampled by the MAS. All of the diagnoses were made by the psychiatric staff of the 
hospital following a period of observation on the admission service. For the purpose 
of simplification, the two groups of neurotics will hereafter be referred to as the 
anxiety and non-anxiety groups; the two groups of schizophrenics will be referred to 
as the random schizophrenic and shock groups. 


1From the VA Hospital, Lebanon, Pennsylvania. 
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RESULTS 
In Tables 1 and 2 are presented the mean MAS scores and t-values for the differ- 
ent psychopathological groups. Among the psychoneurotics, those patients diag- 


TaBLE 1. Mean Taytor MAS Scores 





Mean 8.D. 





Psychoneurotic 
Anxiety 
Non-anxiety 

Schizophrenic 
Random 
Shock 


10.78 
9.82 
12.68 
15.59 
9.27 
9.17 


| Weowwom 





nosed as ‘‘anxiety reaction’”’ failed to show a significantly higher MAS score than 
those with a non-anxiety diagnosis, with the former group actually slightly lower. 
The MAS mean scores were 21.9 for the anxiety group and 23.2 for the non-anziety 
group. Among the psychotics, there was a significant difference between the MAS 
scores of those patients selected at random and the MAS scores of patients selected 
for electroshock therapy, the latter group scoring higher. The random schizophrenic 
group obtained a mean MAS score of 14.9 and the shock group a mean MAS score 
of 20.3. When the total psychoneurotic group was compared with the total psychotic 
group, there was a significant difference between their mean MAS scores, with the 
former receiving the higher scores. For the psychoneurotics the MAS mean was 22.4 
and for the psychotics it was 17.7. 


TaBLE 2. ¢ VALUES OF THE DIFFERENCES BETWEEN MEAN 
Taytor MAS Scores ror DIFFERENT PsYCHOPATHOLOGICAL 
Groups 


Psychopathological groups 





Psychoneurotic vs. schizophrenic 
Anxiety vs. non-anxiety 

Shock vs. random 

Anxiety vs. random 
Non-anxiety vs. random 
Anxiety vs. shock 

Non-anxiety vs. shock 





Further inter-comparisons of the four groups indicate that significant differ- 
ences occur only when the random schizophrenic group is included. By themselves, 
the three other groups (anxiety, non-anxiety and shock) fail to obtain significantly 
different MAS scores, as can be observed from Table 1. 


DISCUSSION 


That the MAS as a measure of “anxiety” is inconsistent with psychiatric diag- 
noses is not surprising, in view of the different frames of reference on which they are 
based. The diagnosis of ‘‘anxiety reaction’ usually follows the APA diagnostic 
manual“) which claims this condition to be “. . . characterized by anxious expecta- 
tion and frequently associated with somatic symptomatology.’’ On the other hand, 
the MAS measures symptoms of a more behavioral nature (restlessness, irritability, 
etc.) and it is dependent on “the subject’s ability and willingness to express such 
symptoms or complaints’’. © 

In a recent study on anxiety in clinical situations, Buss, et al.) reported a much 
lower correlation between the MAS and observed (by rater) categories of manifest 
anxiety than between the MAS and reported (by subject) categories of manifest 
anxiety. It was also reported that the highest relationships were found between the 
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MAS and physiological concomitants of manifest anxiety as reported by the subject. 
Siegman*) attributes a low correlation between the MAS and a manifest anxiety 
rating scale to the fact that the MAS is “loaded with chronic somatic manifestations 
of anxiety (headaches, stomach troubles, etc.)”” which in an item analysis failed to 
differentiate between his psychopathological groups and which he concludes is thus 
‘fan undesired source of variance’. 


SUMMARY 

The present study attempted to evaluate the role of the Taylor Manifest 
Anxiety Scale in differential psychiatric diagnosis. Subjects were 239 hospitalized 
psychiatric patients of whom 82 were diagnosed as psychoneurotic and 157 as schizo- 
phrenic. Results seemed to indicate that although the MAS made some contribution 
to the psychiatric evaluation of these patients, it was limited in its ability to discrim- 
inate between patients carrying different diagnostic ‘‘labels’”’, including anxiety 
reaction. 


REFERENCES 

1. AmprIcAN Psycuratric Association. Diagnostic and statistical manual. Washington, D. C.: 
American Psychiatric Association, 1952. 

2. Buss, A. H., Wrener, M., DurKeEs, A., and Barr, M. The measurement of anxiety in clinical 
situations. J. consult. Psychol., 1955, 19, 125-129. 

3. Keniy, G. A. The psychology of personal constructs. Vol. 1. A theory of personality. New York: 
Norton, 1955. 

4. McReynotps, P. and Ferauson, J. T. Clinical manual for the Hospital Adjustment Scale. Stan- 
ford, Calif.: Stanford U. Press, 1953. 

5. Rusty, H., ScHNEDERMAN, L., Hattow, W. C., and Jongs, R. J. Manifest anxiety in psychotics. 
J. clin. Psychol., 1956, 12, 94-96. 

6. Sreaman, A. W. Cognitive, affective and psychopathological correlates of the Taylor Manifest 
Anxiety Seale. J. consult. Psychol., 1956, 20, 137-141. 

7. Taywor, J. A. Drive theory and manifest anxiety. Psychol. Bull., 1956, 53, 303-320. 


PREDICTION OF REHOSPITALIZATION USING THE MMPI! 


PETER F. BRIGGS 


University of Minnesota 


Peterson? has offered a six-point scale which bears on the diagnosis of sub- 
clinical schizophrenia and the prediction of future hospitalization derived from 
MMPI profiles. The present report describes a study of the power of this scale to 
predict psychiatric rehospitalization. 


PROCEDURE 


Five diagnostic samples were drawn from the inpatient records of the psycho- 
pathic hospital at the University of Minnesota. Four of these samples have been 
presented by Schofield and Briggs? in a study of methods of clinical prediction. The 
diagnostic groups are manic-depressive (17), involutional depression (17), schizo- 
phrenia (30), psychoneurosis (35), and mixed psychoneurosis (42). The requirement 
for selection in the first four groups was the presence of both intake and discharge 
Multiphasic profiles. The fifth group has been described by Holt ®. It included all 
patients having a diagnosis of mixed psychoneurosis admitted to the hospital be- 
tween 1942 and 1948 and having intake MMPIs. 


‘Partly supported by a grant-in-aid of research from the University of Minnesota Graduate 
School. 
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Patients in all five samples were cleared through the state’s Central Index which 
records all cases of hospitalization in state institutions. For the first four groups the 
median follow-up period was five years and ten months. At that time, 30% of these 
samples had been rehospitalized. Among the mixed psychoneurotics, 31 per cent 
had been rehospitalized within a median follow-up period of eleven years. 

All admission MMPI profiles were scored on the six points suggested by Peter- 
son: (a) T scores of four or more of the clinical scales over 70, (b) F > 65, (c) Se > 
Pt, (d) Pa or Ma > 70, (e) Pa or Sc or Ma > Hsand D and Hy, (f) D > Hs and Hy. 


RESULTS 


Inspection of the results suggests that rehospitalization becomes more likely as 
the number of Peterson scale points increases. For the groups combined, this trend 
is not reliable because the schizophrenia group appears to respond with a reverse 
trend at the highest score. Table 1 shows the number of cases rehospitalized in each 


TABLE 1. NuMBER OF CASES REHOSPITALIZED DICHOTOMIZED BETWEEN THREE AND Four ON 
PrETERSON’s SCALE FoR FrvE D1acNnostic Groups* 


Maniac | Schizo- | Involutional | Psycho- | Psychoneurosis 
Factors | Depressive | phrenic | Depression | neurosis Mixed 


Number 

No. Rehospitalized 
% < 4 Rehosp. 
% > 4 Rehosp. 





*One case has been removed from this table because of its double counting, reducing the N 

101 in the first study “ to $9 in this study. 
of the five groups for scale scores dichotomized between score 3 and 4. The table 
shows clearly that the likelihood of rehospitalization among neurotics is greater for 
those with more than three scale points. Taking the same point of dichotomy for the 
Peterson scale, the combined neurotic groups have a rate of 19 per cent rehospitalized 
for those with scores below 4 as opposed to a rate of 52 per cent rehospitalized for 
those with scale scores of 4 or more (X? = P < .01). 

One further comment seems deserved. Among the schizophrenics with four or 
five points, four out of ten patients were rehospitalized; among those schizophrenics 
with six points, only one of six cases was rehospitalized. In this group, bizarreness 
of the symptom picture seerns to be associated with good outcome. Unfortunately, 
the diagnoses on rehospitalization are not known. Such knowledge would provide a 
better test of Peterson’s work. 


SUMMARY 


A follow-up study was done on five diagnostic samples of clinical cases to deter- 
mine whether Peterson’s scale utilizing MMPI profiles can be used to predict the 
incidence of rehospitalization. The results, utilizing median follow-up periods rang- 
ing from five years ten months to eleven years, indicate reliable predictive utility 
for the Peterson scale. 
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NOTE ON A RAPID SCORING PROCEDURE FOR THE CARD FORM MMPI 
GEORGE T. WILCOX 
Adult Psychiatric Clinic, Dayton, Ohio 


A system for encoding significant response and scale on each MMPI card"to 
permit scoring without tabulating has recently come to the writer’s attention. The 
bottom edges of the clinical cards are marked in such a way that when all the cards 
are inverted in the box and aligned against one side, the marks will form a series of 
columns, each aligned with a scale designation on the index cards. Significant res- 
ponse is indicated by color, and scale by position, of the marks. The raw scores for 
each clinical and validation scale plus the raw scores for one or more supplementary 
scales can then be counted up directly from the cards. 

The bottom portion of the front surfaces of the index cards marked ‘‘False”’ and 
“Cannot Say”’ are marked off into 12, 13 or 14 spaces, leaving 14”’ at each end un- 
marked because of the corner notches on the clinical cards. The spaces are then 
labelled with the conventional abbreviations of the clinical and validation scales, 
plus one or two supplementary scales if desired. This labelling could be done in red 
ink on the ‘‘False”’ card and in blue ink on the ‘“‘Cannot Say” card, to avoid confusion 
in scoring. 

Using an analysis of the items in' the scales, such as“), an ink mark about 
1/16” long is made on the bottom edge of each card listed for a particular scale, the 
mark aligned with the space allotted to that scale on the index cards. When the 
significant response is “‘true’”’, the mark is made in red ink. When it is “‘false’’, blue 
or black ink is used. On the 5 Mf scale items differentially scored for testee’s sex, we 
made 3 closely spaced dots, the color of the center dot indicating the significant res- 
—_ for females, and the two outer dots’ color indicating the significant response 

or males. 

To score the record, one simply counts the number of blue marks on the cards in 
front of the ‘‘Cannot Say”’ card, and the number of red marks in front of the “False” 
card, making a cumulative total. This is done for each column. The totals are trans- 
ferred directly to the bottom of the profile chart. 

We have found that, with care in marking, and with no more than 13 or 14 
scales used, there is enough separation between columns of marks to minimize con- 
fusion while scoring. The marking is a laborious process which can be expected to 
take several days’ time. It is best performed by a person with considerable clerical 
aptitude. It is advisable to check the marking by scoring one record by both this 
and the conventional method. We have used this system for about a year. So far no 
patient has indicated awareness of the marks on the cards. 

Among the advantages of this system, the rapid scoring is outstanding. By this 
system, the card form MMPI can be scored more quickly than the group form. This, 
of course, frees the cards promptly for the next administration. Another advantage is 
that the responses that produced a deviant score on the profile can be picked out 
directly from the deck of cards for individual consideration. The major disadvantage 
is that there is no permanent itemized record of the testee’s responses. The only other 
disadvantage which has come to light so far is that one is limited to scoring for the 
scales that are coded. The seriousness of these drawbacks would vary with the 
situation in which the test is used. 
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THE CLINICAL APPLICATION OF SEQUENTIAL ANALYSIS 
ARNOLD TREHUB AND ISIDOR W. SCHERER 


Veterans Administration Hospital, Northampton, Mass. 


INTRODUCTION 


Sequential analysis is a relatively recent statistical development: 2: 3 *), and 
appears of particular interest in the clinical setting. In the clinic or hospital, informa- 
tion of considerable potential value is recorded each day and filed away with the 
hope that some day it will be disinterred and subjected to statistical assessment. 
When the number of cases has grown and it is decided to analyze the data, the task 
of searching the records, transcribing the pertinent information, and making the 
statistical analysis may be a formidable one. In sequential analysis, however, we 
have available an elegant technique which allows us to assess information in terms 
of statistical hypotheses as each case is recorded. One of the potentially most useful 
applications of sequential analysis to clinical problems is in the testing of proportions. 
It is this application which is presented here. This paper has been limited to the 
goal of indicating some of the possible clinical uses of sequential analysis and some 
of the advantages inherent in this technique. 


In setting up a plan of sequential analysis, alternate hypotheses must first be 
explicitly stated in terms of proportions. Conventional symbols for the alternate 
hypotheses are Hy and H;. If we are interested, for example, in the ratio of human 
movement responses to color responses given by hysterics on the Rorschach, we 
might set up alternate hypotheses as follows. Sum C > M occurs in 70% of the 


cases. Then Hp = .70. Sum C > M occurs in 80% of the cases. Then H; = .80. 
After stating Hy and H;, we must decide what proportion of the time we are willing 
to reject Hy when in fact it is true, and what proportion of the time we are willing to 
accept Ho» when in fact H; is true. The statements of these two proportions are called 
a values and 8 values respectively. Having decided the values of Ho, H;, a, and 8, 
we may then draw on any standard graph paper two parallel decision lines, the slope, 
separation, and course of which are determined by solving at least two values for y at 
arbitrary x points in each of the following formulae: 


P, 1- P, i- B 
y loge + x log. = log. (1) 
Po P a 


and 


B 
y log. loge - (2) 
P ig l-a 


where P, is the proportion in hypothesis H;, and P» is the proportion in hypothesis 
Ho; y refers to points on the ordinate of the graph, while z refers to points on the 
abscissa. The line through coordinates computed from formula (1) will be called the 
“accept H,’’ line, and the line through coordinates computed from formula (2) will 
be called the ‘‘accept Ho’’ line. Table 1 gives zy coordinates which can be used direct- 
ly to draw H, and Hp lines at 10 percent intervals, and at the risks of a = .05, 
B = .05. 


1For a more general and theoretical presentation of sequential analysis and related concepts such 
as operating characteristics and average sample number, the reader is referred to the publications 
cited above, in particular the Columbia University Group publication), and Wald). 
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TABLE 1. VALUES OF y AT ARBITRARY VALUES OF 2 FOR ALTERNATE HYPOTHESES 
(a = .05, 8 = .05). 


Alternate 
Hypotheses Arbitrary Values of x 
H ° P, Ho ° Po ‘ 10 


20 





7.6 11.0 
-.8 2.6 


13.9 20.5 


at, 











28.5 


-9.8 











HYPOTHETICAL APPLICATIONS 


Following are some examples of the possible application of sequential analysis 
to typical clinical problems. 


1. It is known that Treatment A results in a clinical improvement rate of 60 
percent for patients diagnosed as paranoid schizophrenics. A new treatment B has 
been suggested. It has been decided that if the hypothesis that treatment B is 60 
percent effective can be rejected in favor of the alternate hypothesis that treatment 
B is 70 percent effective at an acceptable level of confidence, B will be adopted as the 
treatment of choice. In this case, the following sequential analysis plan might be 
set up. 

r a. Let H; and Hp refer to the alternate hypotheses of improvement rate 

with treatment B. Thus, H; : Pi = .70, and Ho : Po = .60. 


b. Let risk a = .0& and risk 8 = .05. 


On the basis of values chosen for H;, Ho, a, and 8, two decision lines are drawn 
(see Table 1, Fig. 1). Each patient diagnosed as paranoid schizophrenic who com- 
pletes treatment B is classified as improved or unimproved. Let us assign the num- 
ber 1 to an improved case, and 0 to an unimproved case. For each 1 recorded, a line 
is drawn one unit in the vertical direction within the channel formed by the ‘‘accept 
H,”’ line and the “accept H>”’ line. For each 0 recorded, a line is drawn one unit in 
the horizontal direction. Each successive line starts at the point reached by the 
preceding entry. As long as the entries remain within the channel, classification and 
plotting of cases continues. When either decision line is crossed by an entry, samp- 
ling stops and the hypothesis represented by the line crossed is accepted. Now if the 
data come in and are classified in the following sequence: 11101111110111001111111 
11011111111, we can see from Fig. 1 that H,; (70 percent improvement) is accepted 
as more likely (a = .05) than Hy (60 percent improvement). In this instance, treat- 
ment B would be adopted as the treatment of choice. 
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Ho : Po = .70. 
While in this example Ho is based upon the proportion of improvement resulting 
from an established treatment, in the general case Hy could, of course, be set by an 
obtained proportion in any relevant control group. 





2. Suppose our problem is not one of choosing between two treatments, but of 
deciding just how effective a given treatment is. To answer this question, one might 
set up several channels each successively more exacting in respect to the percent of 
improvement tested in the alternate hypotheses. The incoming data would then be 
plotted in each channel and the channel with the highest H; accepted would consti- 
tute a measure of the effectiveness of the treatment. As a matter of convenience, the 
several channels at the desired intervals could be superimposed on the same graph as 
in Figure 2. Sampling and plotting would continue until an H,; and an Hp are ac- 
cepted in adjacent channels, or until a limiting P value is accepted.” 

In the previous example, the hypothesis that treatment B results in 60 percent 
improvement was rejected in favor of the alternate hypothesis that treatment B re- 
sults in 70 percent improvement. At this point, sampling was discontinued because 
the question of accepting treatment B as the treatment of choice was answered. If 
now a channel were to be based on H, : P; = .80 and Ho : Po = .70, and sampling 
continued from the point at which the first decision was reached, it is possible that 
the 70 percent hypothesis (Ho) would be rejected in favor of the 80 percent hy- 
pothesis (H;). But then, if we continued plotting within a channel testing H; : Pi = 
.90 against Hp : Po = .80, we might find that H, : P; = .90 is rejected in favor of 
Ho : Po = .80 (see Figure 2). If assessment intervals employed above were adequate 
for our purposes, we would at this point discontinue further testing having accepted 
the hypothesis of 80% improvement in the channel where H; : P; = .80 and Ho : Po 
= .70, and having rejected the hypothesis of 90% improvement in the channel 
where H,; : P; = .90 and Ho : Po = .80 


~~ 30perating characteristics have not yet been derived for decisions based upon such —— 
channel analyses. Thus in the applications described in the second and third examples, exact a and B 
values cannot be attached to the final decision. These procedures nevertheless provide useful esti- 
mates. 
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3. A test index has been devised which is purported to discriminate schizo- 
phrenic from non-schizophrenic patients. In order to determine how well the index 
will discriminate in a given clinical setting, a set of superimposed sequential analysis 
channels can be used as in example 2. Sampling would be undertaken according to a 
scheme that would reflect normal base rates for the various diagnostic groups. The 
direction of each channel entry would depend upon the correspondence of the test 
index classification (schizophrenic, non-schizophrenic in this case) with the criterion 
classification for each sampled patient. Each correct match would be entered as a 
unit advance in the vertical direction, while each non-corresponding pair would be 
entered as a unit advance in the horizontal direction. The power of discrimination of 
the test index would be reflected in the channel with the highest H, accepted. 

4. An intelligence test has been introduced which can be administered and 
scored in a fraction of the time required for standard tests. As one approach in 
assessing the new test, we wish to find out how well it will predict achievement on the 
Wechsler-Bellevue. In this case, two aspects of the power of prediction might be con- 
sidered: (1) the proportion of cases correctly predicted, and (2) the range of tolerance 
within which a given prediction will be considered correct. Let us say that we de- 
mand of the new intelligence test a correct prediction in more than 80 percent of the 
individuals tested. The criterion of how well the new test will predict Wechsler- 
Bellevue achievement would then be the minimum range of tolerance which will 
result in greater than 80 per cent correct prediction. Achievement on the new test 
might be plotted against achievement on the Wechsler, and several tolerance bands 
drawn about the principal diagonal, e.g., + two IQ points, + four IQ points, etc. 
Thus the proportion of correct predictions will decrease as the tolerance decreases. 
Correct and incorrect predictions for each tolerance band would be identified and 
entered separately in a sequential channel where Hy = .80. The minimum folerance 
which yields an “accept H,’’ decision would constitute a measure of how well the new 
test predicts achievement on the standard test. 


CoMMENT 
The following advantages of the sequential analysis method should be noted. 


1. When sequential analysis is used in testing percentages, it deals directly 
with the classification of individual cases. This is of particular importance in the 
clinical situation where the percentage of mistakes in the disposition of individuals 
is of prime concern. 

2. A sequential channel once computed may be used for any number of prob- 
lems requiring decisions within the same statistical limits. Thus as various types of 
analyses are made, a growing library of readily accessible pre-computed channels will 
be built for use with future problems. 

3. Once sequential channels have been computed, the task of classifying in- 
dividuals according to psychological test results, diagnosis, response to treatment, 
etc., as well as making channel entries, can usually be performed by clerical personnel. 

4. With sequential analysis, statistical testing of research data is an ongoing 
process which can keep pace with the influx of pertinent cases and which signals the 
stop of sampling at precisely the number of cases required to yield a decision in 
respect to the hypotheses tested. While the use of this method with incoming data 
has been emphasized, it is, of course, also possible to test hypotheses by sampling 
from existing files. 

5. For the hypotheses and error risks stated, sequential analysis will yield a 
decision with fewer cases on the average than other techniques will require. 
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AN APPROXIMATE TABLE FOR DETERMINING SAMPLE SIZE 
BY MEANS OF THE SIGN TEST 


MOSS L. RAWN 


Creedmoor Institute for Psychobiologic Studies 
Queens Village, N. Y 


When the form of a distribution is not known or suspected to be other than 
normal, a conservative and approximate test for significance may be made with the 
sign test for which the only underlying assumptions “are that the differences are 
continuously distributed and independent”: »- “). It may also be used for normal 
distributions although more precise methods are preferable. For those experimental 


Tas_e 1. Two Tar Sian Test at P or .05 For SAMPLE S1zB AND 
NuMBER OF PositIvE AGREEMENTS! 


N m PT% 


98 .3¢ 41 28 66. 
94.2 42 28 66 .: 
91. 43 29 66. 
: 44 29 65.¢ 
30 65.7% 








31 65.£ 
31 65.% 
32 65. 
32 65 .0: 


33 64. 


33 64. 
36 64.12 
39 63 .£ 
41 62 .9% 
44 


53 
61 
74 
88 
101 


120 
168 
221 
273 
338 


635 
1,300 
2,570 
5,099 
40,000 20,197 


1,000,000 500,981 


66. 
66.7: 





1! N is sample size. - 
m, the larger part of N, is the number of positive agreements rounded off to the next highest 
integer. 
p% is equal to unrounded m/N_ X 100. Minor discrepancies would exist, because of rounding, 
between table p% values and less accurate p% values that would derive from the listed m 
rounded values. 


* A minor correction may be subtracted, although this is hardly necessary. See text for comment. 
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purposes where a quick and approximate way of determining sample size is needed 
for a given number of positive agreements, the sign test may be employed. Walker 
and Lev, regrettably do not make available a table for ready, easy reference which 
may be constructed using their sign test formula“: ». “"), Solving for m, larger part 
of N sample, number of positive agreements, with P at .05, yields: 


1969 ¥VN+N+1 





m= 


2 


No calculations from the formula are needed to use table 1 except for interpolated 
values. 


Frequently the question arises about size of sample needed for significance with 
a given proportion of cases. One may wish to know, for example, how large a sample 
of shoppers is needed for significance at .05 with 65% of the sample expected to be 
women. In the p% column 64.86 is found* to correspond to an m of 33 and an N of 
50. Therefore 50 cases would be required to anticipate 65% of them to be women. 

Another type of question is as follows: if 150 patients received. medication, how 
many should show improvement to defeat the null hypothesis at .05? The table 
shows that 88 persons would have to improve for significance to be demonstrated 
(for more precise purposes, the corresponding p% figure 58.34 may be used). 

Use of the sign test for less than 25 cases instead of the generally recommended ® 
cumulative binomial probability distribution, results in very little loss of precision. 
This loss is in a conservative direction, viz., sign test calls for larger m values to ac- 
cord significance than does the binomial approach. Accordingly, in the table for the 
items marked with an asterisk N of 14 and 17, the corresponding sign test table 
rounded m values 12 and 14 should be reduced to the binomial values 11 and 13 res- 
pectively. All other table m rounded values need no correction. The wnrounded p% 
figures found in the table, however, do not correspond exactly to binomial values. 
For rigorous purposes, the latter approach should be used for less than 25 cases, 
although under ordinary circumstances the sign test should serve these small 
samples well enough. 


REFERENCE 
1. Waker, H. and Lev, J. Statistical Inference. New York: Henry Holt, 1953. 


*Rounding always to next highest integer or 65. 
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CONSTRUCTION OF A TABLE OF THE SIGNIFICANCE OF THE 
DIFFERENCE BETWEEN VERBAL AND PERFORMANCE 
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PROBLEM AND METHOD 


Clinicians often assign significance to a wide discrepancy between Verbal and 
Performance IQ on the Wechsler Scales. However, there appears to be lack of agree- 
ment as to the discrepancy necessary to warrant special consideration. It is felt that 
a table eliminating cumbersome computations in this area and that affords a rapid 
and objective inspection of significance values would be very helpful. The technique 
of Payne and Jones“? was used to obtain a standard score Z, one-tailed and two- 
tailed probabilities for the differences between a Verbal and a Performance 1Q. 

Table 1 gives these data for both the WAIS and the Wechsler-Bellevue. 


TasBLe 1. SIGNIFICANCE OF THE DIFFERENCE BETWEEN VERBAL IQ AND PERFORMANCE IQ ON THE 





Difference (d) Probability ofobtaining | Probability of either 
between the Z Score or exceeding d Verbal or Performance IQ 
Verbal and Per- | 

mance IQ | 
WAIS W-B F W-B 





206 242 | 105 121 
.152 189 
.109 .147 
.075 109 
.050 081 
032 .057 
.020 .040 
.012 .027 
.007 017 
.004 O11 
.002 .007 
.001 004 
.000 002 
.000 001 








0. 
1. 
# 
1.44 
1.64 
1.8: 
2. 
, > 
2. 
26 
2. 
3. 
3.2 
3. 





Clinicians and research workers are cautioned to accept the values offered in the 
table as being of strictly statistical derivation and significance. The values for the 
Wechsler-Bellevue are less reliable than for the WAIS, because of the variations in 
means and standard deviations at different age levels. 


SUMMARY 
In order to rapidly and objectively determine the significance of the difference 
between an individual’s Verbal and Performance IQ on the WAIS and the Wechsler- 
Bellevue, a table was constructed listing Z scores and probability levels for a range 
of differences. 


REFERENCES 
1. Payne, R. W. and Jones, H. Gwynne. Statistics for the investigation of individual cases. 
J. clin. Psychol., 1957, 13, 115-121. 


1This study was completed when the author was a Clinical Psychology Trainee at the Norfolk 
(Nebraska) State Hospital. 
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PROBLEM 

This is an exploratory study of a systematic situational testing technique to 
predict social behavior. The technique is based on Starer’s“ study which concluded 
that affective stimuli in the form of personal questions measurably affected subjects’ 
drawings of Bender Gestalt designs. Starer did not isolate a particular affect, nor 
did he control adequately for nonexperimental differences in the subjects’ capacities 
to improve in their drawing of the designs, but it did seem to lend some degree of 
support to the hypothesis that personally threatening affect measurably affected 
subjects’ drawings of geometric designs. In order to further clarify this hypothesis 
and evaluate the use of a modified version of this technique to predict social behavior, 
the following hypotheses were tested: First, N-P patients will improve less in their 
drawing of a geometric design under the stimulus of a recorded voice of a relevant 
authority figure than will hospital aides, improvement being measured by decrease 
in time and improvement in quality from first to second performance. Second, 
rated cooperativeness of the subjects will be related to the amount of improvement 
in their drawing of the geometric designs under the stimulus of a recorded voice of a 
relevant authority. 

METHOD 

Two groups of fifteen male subjects each were studied. The groups differed on 
the criterion of behaviorally being presently able or unable to adjust to social living. 
Holding a job as a hospital aide in a VA General Medical or Surgical Hospital or 
being a psychiatric patient in the same hospital were considered prima facie evidence 
of falling in one group or the other. All of the aides in the hospital were ranked by 
the chief nurse on their cooperativeness with authority. The patients were rated on 
their cooperativeness with authority by 6 aides making independent ratings. Meas- 
ures of concordance and correlation revealed a surprising amount of agreement be- 
tween aides in their rankings. Separating the aides into two groups revealed a rho 
for the rankings of .83. Five persons from upper, middle, and lower parts of the 
rankings were selected as subjects. 

Each subject was tested individually. They were presented 12 geometric de- 
signs, one at a time. Structurally, these designs were one basic design, composed of 
4 component forms, rotated in twelve different positions in order to have 12 presenta- 
tions. Immediately after drawing the designs, the subjects were given the following 
instructions: ‘““That was very good. I would like you to draw these twelve designs 
again, but this time as you draw them you will be listening to a record. As you listen 
to the record, please draw the designs. Do you have any questions?”’ As the subject 
drew the design for the second time, a record was played on which the assistant 
manager of the hospital made a statement regarding hospital discipline, a clerk in the 
hospital read a column from Life magazine about European nobility, and the arm of 
the record player was lifted from the record. There was a balanced presentation of 
these three stimulus conditions, so that each subject drew 4 designs under each 
condition. 

Time to complete each design was measured, and each design was scored as to 
quality. Each of the four component forms, the relationship between them, and the 
design as a whole were judged on a five point scale of very good to very bad. These 
ratings were summed to obtain a quality score for each design. A three week test- 
retest scoring reliability study revealed a Pearson product moment r of .93 for quality 
scores. Differences between first and second performances were calculated, and an 
analysis of variance design was used to test hypothesis 1 and correlational tech- 
niques were used to test the second hypothesis. 


*I would like to thank M. W. Gasper, M.D., Manager, Miss 0. L.. Hufcut, Chief Nurse, and 
members of the N-P and Special Services of the Wilmington VA Hospital for their cooperation in this 
study. 
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RESULTS 
The data in Table 1 indicate that the patient group was more heterogeneous 
with regard to the quality of their initial drawings and the time needed to complete 
these drawings than was the aide group. Apparently, the present technique can 
measurably discriminate between the two groups. 


TABLE 1. DirFERENCES BETWEEN PaTIENT AND Arpe Groups wITH REGARD 
TO THE | Maal AN ‘Qu ALITY AND Time ScORES ON THE First PERFORMANCE 


Fs actors Me: an 


Aides 28.71. 10. 96 
Quality 
Patients 27.13 26 .62 


Aides 33.4 137 .62 
Patients 43 .37 1020.53 





*Significant at .05 level 
**Significant at .01 level 


DIscuUSSION 
The data in Table 2 seem to indicate that hypothesis 1 may be rejected. There 
was no apparent difference between N-P and aide groups with regard to their im- 
provement in quality and time of drawing the designs. There were no differences 


TaBLe 2. F Ratios ror PATIENT AND ArpE Groups, VoIcE oF AUTHORITY, 
NEUTRAL VOICE, AND No Voice ConpDiITIONS, AND INTERACTION, BasED UPon 
a... ALITY AND TIME ScorEs 





Required F at at . 05 Leve a 





Subjects 
Voices 
Inte raction 


between voices of authority, neutral voice, and no voice stimulus conditions in this 
regard, nor were there apparent interaction affects between these stimulus conditions 
and the two groups of subjects. Correlations were obtained of .72 and .47 between 
patients’ ranking in cooperativeness and reduction in time needed to complete the 
designs under the stimulus of the voice of authority and the more neutral voice 
respectively. Apparently, the more cooperative the patient was judged the less he 
decreased in time needed to draw design from first to second performance. This does 
not seem to indicate a difference in the two stimulus conditions however, for there 
was no statistical difference between these two correlations, ¢ for corresponding Zs 
is .97. 

The data seem to indicate that hypothesis 2 may be partly accepted. Although 
the correlations between judged cooperativeness and improvement in quality all 
seemed to be insignificant, the correlation of .72 discussed above suggests that the 
greater the judged cooperativeness of the patients the less they decreased in time 
necessary to draw the designs under the stimulus of the voice of the assistant manager 
from first to second performance. An r of —.38 for this same relationship with aides, 
although not statistically significant, suggests an opposite tendency with the aide 
group. It should be noted here that judged cooperativeness with the aides was de- 
termined by the ranking of the supervisory nurse who had less day to day contact 
with the aides than did the 6 aides who ranked the patients. 

The present study seems to reveal a lack of potency of the independent var- 
iable, a record of the voice of a particular assistant manager, as well as the proba- 
bility that the two groups differing in their distribution of time and quality scores in 
initially drawing the designs, had comparable capacities to improve in their drawing 
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of the designs on the second trial. The present finding of equivalence between the 
three stimulus conditions plus the significant relationship found between patients 
judged cooperative and decrease in time between first and second trials needed 
to draw the designs, seem to suggest that a similar analysis of variance design, using 
a more potent independent variable might differentiate important relationships 
which might aid in the prediction of social behavior. 


SUMMARY 

A group of N-P patients and a group of hospital aides drew 12 geometric designs. 
Immediately after their initial drawings, they drew the designs again under a bal- 
anced presentation of three stimulus conditions. A recording of the assistant manager 
of the hospital making a statement regarding hospital discipline, a recording of a 
clerk reading a passage from Life magazine, and no voice presentation were the 
three stimulus conditions. Amount of change in quality and time from first to second 
performance were the dependent variables. 

An analysis of variance design revealed that there were no differences between 
the N-P and aide groups with respect to their improvement in their drawing of 
geometric designs. A product moment correlation of .72 suggested that the greater 
the judged cooperativeness of the patients, the less they decreased in time necessary 
to draw the designs under the stimulus of the voice of the assistant manager from 
first to second performance. The r of —.38 for this same relationship with aides, al- 
though not statistically significant, suggests an opposite tendency with the aide 
group. 

REFERENCES 
1. Srarer, E. The Effects of two simultaneous cognitive and affective stimuli. J. clin. Psychol., 
1952, 8, 402-405. 


A NOTE ON THE RELIABILITY OF THE L-M FERGUS FALLS 
BEHAVIOR RATING SCALE 


LLOYD K. SINES 
Fergus Falls State Hospital, Minnesota 


INTRODUCTION 

The L-M Fergus Falls Behavior Rating Scale“: *) has been used as a criterion 
measure in a number of published studies dealing with the effects of specific therapies 
on various psychiatric samples. ‘> ?) It is obvious that the efficacy of this procedure 
depends upon the reliability and validity of the rating scale itself. Although both of 
these aspects of the scale were dealt with in the earlier reports: *’, the present paper 
deals with certain limitations in the use of the scale because of the characteristics of 

subscale reliability. 

PROCEDURE 

Sixty seven psychiatric patients of varying diagnoses (preponderantly schizo- 
phrenia) who were currently in some phase of a total push program“? at the Fergus 
Falls State Hospital were rated on the L-M scale by four psychiatric aides who were 
well acquainted with them. The L-M ratings were done in the usual manner, none of 
the raters communicating with any of the others in regard to specific patients or 
ratings. Of the total group of patients, 33 were male and 34 were female. Two male 
aides rated the male patients, while two female aides rated the female patients. Area 
F on the scale (Response to Electric or Insulin Therapy) was not rated since none of 
se _— had received either therapy during the period on which the ratings were . 
ased. 
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RESULTS 


Several measures of reliability were computed. First, since for each patient two 
L-M ratings were obtained, an inter-rater reliability coefficient was obtained. That is, 
aides A and B rated (male) patients 1 through 33 from which rz, was obtained by 
correlating the two mean L-M scores for X; (patient number one) through X3; 
(patient number 33). Similarly, aides C and D rated (female) patients 34 through 67 
from which rep was obtained by correlating mean L-M scores for Xg through X¢r. 
Finally, the male and female groups were combined and an overall reliability co- 
efficient (r; 1) was computed between the 67 L-M means obtained from the four 
raters (Table 1). 


TaBLE 1. PARAMETER VALUES AND INTER-RATER RELIABILITY COEFFICIENTS 
OBTAINED BY CORRELATING MEAN L-M Scores ON A SAMPLE OF 67 
_lvrewnve TREATMENT re Pariswrs _ 


Inter-rater 
reliability 


Average L-M Scores 





8.D. 


|e 

| | 

2.97 | 608 | & 
| | 


N | Mean 
33 
34 2.89 .800 
67 | 





Male 
Female 


Total group 2.93 .714 


| 
| 
| 
| 





As a further estimate of inter-rater reliability, the per cent of agreement between 
raters on subscale ratings was computed (Table 2). It may be seen from Table 2 that 
in 92 per cent of the ratings the two raters either were in absolute agreement or differ- 
ed in their ratings by only one point. 

TaBLe 2. THe Per Cent or AGREEMENT BETWEEN RATERS AND THE NUMBER 


oF CASES IN WHICH DISCREPANCIES OF VARYING AMOUNTS OBTAINED 
Betw EEN THE Two RATERS: 





Discrepancy N % 
between raters 





233 
43 
9 

0 


658 











The values reported in Tables 1 and 2 compare favorably with those reported in 
the original article on the scale“) and indicate that the inter-rater agreement, in 
terms of total L-M score and per cent of agreement on actual ratings, is adequate to 
the demands of most clinical-type research in which the scale is used. However, 
demonstrating the reliability of total or mean scores does little in the w ay of estab- 
lishing the degree of reliability of the various subscales or areas included in the scale 
as a whole. Since it has been suggested that pre-post therapy analyses of subscale 
scores may elucidate specific areas of change i in patient behavior, a determination of 
the reliability of the individual scales is in order. The following procedures were 
carried out in obtaining the pertinent values. Each of the ten L-M areas (subscales) 
rated was considered individually and inter-rater reliability coefficients computed. 
Thus rap, Pcp, and r; 1 were obtained for each of areas A through K, with the ex- 
ception of area F (Table 3). From Table 3 it is apparent that the ten subscales differ 
with respect to reliability, and with possibly one or two exceptions, none approaches 
the level of reliability usually demanded of complete tests. 
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DIscUSSION 


The apparent lack of high subscale reliability precludes the feasibility of using 
specific L-M items or areas as tests by themselves. Further, pre-post therapy an- 
alyses of specific areas on the scale are hazardous in view of the relatively low re- 
liability of several of the subscales. For example, apparent changes in psychomotor 
activity (area 1) ratings after therapy could arise from the low reliability of that scale 
as well as from an actual effect of the therapeutic procedure on the patient’s psycho- 
motor behavior. 

Although the subscale reliability coefficients range from .513 to .772 and the 
overall (A through K) reliability of the subscales is but .653, the substantially higher 
reliability of total or mean L-M scores indicates that the scale may be used as a 
whole with assurance that observed changes in the latter are reliable within the limits 
generally accepted. Finally, in view of the demonstrated validity of the scale“, it 
may be said that significant changes in total or mean L-M scores reflect actual changes 
in the behavior of the patients rated. 


SUMMARY 


Sixty seven psychiatric patients on an intensive treatment service of a state 
hospital were rated by four psychiatric aides on the L-M Fergus Falls Behavior 
Rating Scale. Various measures of reliability were computed on both total or mean 
L-M scores and the actual subscale ratings. It was found that while mean L-M 
scores have adequate inter-rater reliability, the several subscales differ with respect 
to reliability and none approaches the usual standards of adequate test or complete 
scale reliability. It was concluded that while the L-M scale can be used as a whole, 
treating total or mean score as a dependent variable, subscales should not be used 
individually, nor should specific ‘‘item analyses’’ be made to determine the actual 
areas of change in patient behavior as a result of significant total score changes. It is 


possible only to infer changes in “gross observable behavior’’ from significant L-M 
total score changes rather than to delineate with high accuracy specific areas of be- 
havior change. 
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THEORY VERSUS PRACTICE IN CLINICAL PSYCHOLOGY 


There is a wide discrepancy in clinical psychology between theory and practice, 
between what we believe and what we do. Our tests, for instance, are at variance 
with our general theory of the nature of maladjustment. We state convincingly that 
the problem resides not in individuals, in parents or in children, but in a faulty re- 
lationship between the two. But our psychological tests were devised to learn what 
is wrong with the individual, thus placing the locus of the problem in him. What we 
need is a whole new battery of tests, or new ways of interpreting the tests we have, to 
get at the faulty relationship rather than the faults of individuals. 

The search for the trouble in an individual turns on an antiquated theory of 
causality. There have been many criticisms of the traditional doctrine of cause and 
effect, but the one that is most pertinent here is the notion of passivity. In the mech- 
anical view, the cause was considered to be the active agent and the effect the pass- 
ive consequent. But in human relationships the so-called effect or consequent is as 
active as the so-called antecedent or agent, so that there is a reciprocal effect on each 
other and both persons are affected. Despite our disavowal of the old empirical view 
of causality in the practice of clinical psychology, by the use of tests we assume that 
the parents are the agent or the cause, and the child is the effect or the result of the 
interaction which naturally exaggerates the role of the parent and minimizes the role 
of the child. The formulation of a modern theory of causality consonant with human 
interaction is a desideratum. 

The prevalent attitude towards symptoms, as surface signs hiding the true cause 
or the real illness, is related to our notion of cause and effect. Illness, in clinical psy- 
chology, is an abstraction, and only the symptoms are real, for they are the mal- 
adjustment. The search for the true cause is a search for the non-existent. This 
erroneous kind of thinking may be due to the analogy to medicine where an illness 
may have definite causes and symptoms. Emotional problems are not an illness in 
the same sense; they are maladjustments due to maleducation. 

Another factor lies in our theory of personality. We constantly state that person- 
ality is not fixed or set like plaster, but something pliable and protean. Yet our tests 
assume that we have a veridical personality that can be understood by making a few 
soundings or plumbing of its homogenous depth. While we admit that personality is 
changing, we explore it as if it were stable rather than in a state of flux. We treat it 
as if it were a unitary thing, consistent and uniform, whereas it may be that its divers- 
ity is more important than its integration, and calls for a more suitable figure of 
speech that represents this construct. 

Another point turns on the two opposing pillars upon which our psychology is 
built. One of these pillars is the psychology of individual differences: everyone is 
fundamentally unique and different. This is a grand generalization, but it is also a 
generalization to end all further generalizations: it doesn’t make it possible to formu- 
late any laws or principles and limits clinical psychology to an art rather than a 
science. On the other hand, we also believe that everyone is fundamentally the same. 
Our differences are so slight that they can all be placed on the same continuum and 
form a normal frequency curve. This permits the statistical treatment of personality. 

The acceptance of these two diametrically opposed concepts without any at- 
tempt to harmonize them widens the gap between theory and practice, for it enables 
us to profess one position and practice the other. We are able to justify the use of 
either frame of reference for our theory or practice. Clinical psychology will con- 
tinue to suffer from a disparity between theory and practice until these two basal 
dicta are resolved. This task seems more important to the success of clinical psycho- 
logy than the verification of current theories or the formulation of new ones. 


SALVATORE Russo 
Wichita (Kansas) Guidance Center 
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“BROAD SPECTRUM” CLINICAL PSYCHOLOGY 


As applied to clinical practice, ‘broad spectrum” refers to methods which en- 
compass the entire range of factors related to etiology, diagnosis and therapy. In 
contrast, ‘‘narrow spectrum” methods are directed toward only some limited range 
or aspect of clinical phenomena. The concept recently has attained wide recognition 
in clinical medicine, particularly with reference to wide-range antibiotics and other 
drugs which are effective against a wide range of etiologic agents in contrast with 
earlier methods having only limited effectiveness. It is intended to emphasize the 
necessity of recognizing and dealing with a wide range of etiologic factors compre- 
hensively. 

There is evidence that not all viewpoints in clinical psychology are properly 
described as ‘‘broad spectrum” approaches. At the 1957 APA convention at New 
York, there was considerable joking about colleagues who persist in playing one- 
stringed violins. Thus we have the EENT (eye-ear-nose-and-throat) psychologists 
who are still preoccupied with brass instrument studies of sensation. And the “‘rat”’ 
psychologists or reflexologists who preach that all can be known from the beha- 
vioristic or neobehavioristic study of the laboratory animal—they paradoxically 
ignore the complexities of higher level phenomena such as interpersonal relations. 
And the dyed-in-the-wool experimentalists who sometimes sound as sad when they 
branch off into clinical speculations as do intuitive clinicians at play in the experi- 
mental laboratory—in fact we were positively embarrassed at the APA to hear what 
some noted experimentalists had to say about clinical matters when they wandered 
afield from their stated topics. And so on through adherents of all the other schools, 
each apparently deriving some inner gratifications from identifying with or creating 
some special theoretical position. 

It is difficult to see how a genuinely “broad spectrum” approach can be any- 
thing but thoroughly eclectic. “Broad spectrum” means more than the attempt to 
broaden out any one theoretical model until it can explain everything in the field. 
Is clinical psychology a separate specialty or profession, or is it simply one applica- 
tion of basic science psychology? Some experimentalists appear to be in the position 
of arguing that anyone with a solid background in basic science psychology, e.g. 
a solid experimental-statistical orientation, should be competent to practice in the 
applied fields which are regarded as simply practical applications of basic science. 
The argument seems to be that a good experimentalist has the basic tools to succeed 
in any specialty he attempts. Without denying the value of a sound experimental- 
statistical background, we very seriously doubt whether anything can replace years 
of experience and clinical practice with actual case materials in our field. 

If anything, the clinician has to be more “broad spectrum” eclectically-orient- 
ed than the experimentalist, statistician, comparative psychologist, psychophysiolo- 
gist or learning expert. Anyone who proposes to deal with the personality as a whole, 
studying the organism-meeting-its-environment, and dealing with the complexity 
of interaction taking place in psychosocial fields, must indeed be a master scientist, 
familiar with and competent in all the borderline specialties which may contribute to 
the problem. At a time when a genuinely “broad spectrum’’ approach to clinical 
practice is possible, anyone who voluntarily limits himself to anything less broad 
must be regarded as not practicing according to the highest standards of time and 
place. The older generation of clinical psychologists may be partially excused from 
living up to “‘broad spectrum” qualifications of competence because of training de- 
ficiencies of their era. But the younger generation has an obligation to live up to the 
“broad spectrum” developments which are just over the horizon. The “broad 
spectrum” approach is essentially an attitude, a characteristic manner of thinking 
and operating in regard to data. 
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Jones, Ernest. The Life and Work of Sigmund Freud. Vols. 1, I, III. New York: 
Basic Books, 1953, 1955, 1957. Vol. I, $6.75. Vol. I1, $6.75. Vol. 111, $7.50. 


Perhaps never before have the fortuitous circumstances occurred making poss- 
ible an intensive biographic study of an authentic genius made by a distinguished 
psychoanalyst utilizing a stupendous amount of data gathered from life record, 
introspective reporting, objective examinations and projective sources. Freud, the 
founder of psychoanalysis, lived in a sort of psychoanalytic fish bowl in the sense 
that his students and followers repeatedly scrutinized their master in a succession of 
penetrating analyses, most of which have gotten into print. Probably it can be said 
that never before has any one person been so intensively studied by so many psycho- 
analytic experts as was Freud. While other geniuses have been venerated and written 
about, it is safe to say that never before has so much intimate evidence been collect- 
ed concerning the life and inner motivations of any one man. The result of all this 
intensive study is one of the best biographic studies ever written. This review must 
start by paying homage to Ernest Jones who was not only one of Freud's most 
staunch friends and followers, but who also must be granted recognition as a sterling 
scientist and writer in his own right. Having known Freud for thirty years, and 
labored with him unceasingly to advance the cause of psychoanalysis throughout the 
period, and finally being instrumental in securing refuge for Freud in England when 
he was endangered by the Nazi outrage, Jones may lay claim to having known the 
master as well as anyone not a member of the family ever could. Science will be 
eternally grateful to Jones for having undertaken the massive research and literary 
project necessary to collate so much material. 

No review could be capable of summarizing the numberless insights into Freud 
the man which are contained in these three volumes. These the reader can only get 
for himself by patient reading, and rereading, the fascinating story of how Freud 
slowly matured into the gigantic figure which he was. Here are only a few points 
which intrigued this reviewer. It appears that Freud as a small lad had some in- 
tuition concerning his own genius and great destiny for there are repeated examples 
when he seemed inspired from an early age by inner creative spasms from which 
original thoughts evolved. It appears that beginning with his mother, the quality 
and depth of his creative genius quickly made itself apparent and won many friends 
and supporters. Freud early showed an intensity of feeling and thought which re- 
peatedly manifested itself in many overt behaviors. The complexity and sensitivity 
of his personality were particularly revealed in the troubled years of courtship when 
he was still too poor and unsuccessful to get married. That these did not unsettle or 
distract him is a tribute to his own stability and the inner strength resulting from his 
personal analysis of himself. Freud was a prolific writer, and was written about pro- 
lifically, so we have a rich abundance of data concerning what he seemed to be ex- 
ternally and when went on introspectively. This reviewer constantly experienced 
feelings of a deep understanding and admiration of Freud as a man on reading the 
intimate details of his life. In his own personal life, Freud came as close to being as 
healthy, mature, stable, responsible, dedicated, understanding, tolerant and creative 
as any mortal could possibly be. From the emotional turmoils of courtship, through 
the bitter travail of his creative scientific life, to the final heroic struggle with fatal 
illness, the personal life of Freud gives as convincing a picture of inner strength and 
self-consistency and decency as can be imagined. Freud had his conflicts, ambi- 
valences and impulsive moments, but these are tremendously overshadowed by the 
dedication to long range goals and sincere determination to do what was right in 
both personal and scientific life. Jones has patiently documented in great detail 
what actually went on during the critical periods of Freud’s life and it all makes 
fascinating reading. 
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The story of how Freud patiently constructed the theory and practice of psycho- 
analysis raises him to the rank of being one of the greatest empiricists in the history 
of scientific discovery. Although Freud is known for his great deductive contribu- 
tions, his basic reliance was on the most painstaking and thorough study of raw 
behavior by inductive methods. His early training in neuropathology gave him a 
solid grounding in the experimental-objective approach to raw data, and he went far 
beyond just applying the popular theories of his era to the creation of entirely new 
ideas. One of his greatest abilities was a fine flair for recognizing the crucial import- 
ance of critical incidents which gave clues which lesser men ignored. Repeatedly, 
we find him recognizing the significance of relationships which others had just passed 
over. He kept asking himself: ‘‘What could this mean?—and then coming up with 
original insights in a truly astounding profusion. The historical developments of the 
great discoveries of psychoanalysis are too well known to be repeated here. The 
truly unusual thing is the fact of such great productiveness in just one man. 

It has been said that no radical thinker is ever able to hold out in his reformist 
efforts into old age in the face of the cumulative weight of opposition and derogatory 
public opinion. The penalty of original thinking usually is to be ignored and ostra- 
cized by the entrenched political forces and dogmas of the day. Freud suffered more 
than most productive geniuses in having his discoveries ridiculed and derided by the 
mass of more orthodox thinkers. His theories concerning the sexual and emotional 
etiology of neurotic and normal behaviors outraged the conventional morality of his 
times. Indeed his treatment at the hands of the Viennese authorities is but another 
proof of the old axiom that no man can be an hero in his own town. Lesser men 
would have desisted and shriveled in the face of such powerful opposition but Freud 
persisted steadfastly in his duties as he saw them. One of the marks of a great man is 
to be unceasingly active, whether right or wrong; lesser men vegetate in the safety of 
passivity and conventionality. One of his rewards was to live to see the day when his 
great discoveries would be publicly recognized and heralded for their true worth. 

The critical reader will be interested to note that Freud himself remained hard- 
headedly critical of himself and the developments of his times to the end of his life. 
Never was there any weakness or deterioration, even in old age, in his determination 
to discover the truth and to validate his hypotheses and findings by every possible 
means. Freud himself was indubitably more critical than many of his followers have 
been, and he probably would unmistakably dissociate himself from many of the 
excesses which have characterized the later development of psychoanalysis. Even 
though its strongest propounder and defender, Freud himself was broader than 
psychoanalysis in that he repeatedly applied the law of parsimony and revised his 
own theories when he saw that new evidence demanded it. The breadth of Freud’s 
education was truly remarkable for his day, and his insights were far broader than 
those of many figures.on the current scene. One may quarrel with points of psycho- 
analytic theory but we cannot fail to recognize the soundness and depth of Freud 
the man. 

One of the strong points of this biography is the mass of interpretive material 
which Jones has added at each point in the historical development of the story. 
Indeed, the student can learn about psychoanalysis from reading the life of Freud 
and how one development led to another. This reviewer feels that The Life and Work 
of Sigmund Freud must be required reading for every psychologist and psychiatrist. 
Next to having been intimate with Freud as a living person, the best thing is to be- 
come thoroughly familiar with this monumental objective study of what he did 


and was. 
Pca 2s 
CaTTELL, RaymMonp B. Personality and Motivation Structure and Measurement. 


Yonkers, N. Y.: World Book, 1957, pp. 948. 


Dr. Cattell and his University of Illinois Laboratory of Personality Assessment 
and Group Behavior staff have accomplished a truly stupendous amount of work in 
personality study using factorial methods. The development of his theory, research 
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program and practical test design has progressed so rapidly as to require frequent 
extensive reviews of their theoretical and practical implications. The fact that Dr. 
Cattell is one of the most prolific and original thinkers on the current psychological 
scene renders it very difficult to write an authoritative review of all the concepts and 
data included in this compendious work which covers such a wide range of topics 
that not many contemporary psychologists are competent to comprehend the tech- 
nical details of all that is presented. Indeed, the whole field of factor analysis is so 
new and with so many unsettled problems, that the ultimate validity of much that 
Cattell has done must await the time when there can accumulate sufficient data and 
cross-validation experiments so that general agreements on basic issues can be reach- 
ed. Cattell’s sweep is so broad that his enthusiasm frequently leads him into relative- 
ly uncharted areas where speculations reach far beyond what has been actually 
validated. It will require patient checking in independent laboratories to determine 
whether Cattell’s results can be replicated and cross-validated. 

This reviewer does not wish to be guilty of using the device of slyly derogating 
a major work by citing a number of apparent inconsistencies or controversial points 
and thereby casting suspicion on the value of the work as a whole. Cattell’s work has 
not received the attention it deserves in American psychology, partly because its 
scope is beyond the comprehension of many of his critics and partly because Cattell’s 
enthusiasm to pioneer new areas has sometimes led him into premature conclusions 
which have been unfairly cited as characterizing his whole work. We suggest in all 
seriousness that the AMERICAN PsycHOLOGICAL AssocIATION would do well to 
organize symposia and study groups to bring the best minds in the profession to 
bear on Cattell’s work in order to arrive at a better understanding of its ultimate 
validity. In the meantime, this advanced text is heartily recommended to those 
serious readers who have the patience and the background to encompass all that is 
presented. 


Corsini, RaymMonp J. Methods of Group Psychotherapy. New York: McGraw-Hill, 
1957, pp. 251. $6.50 


The author is research associate at the Industrial Relations Center of the Uni- 
versity of Chicago and counselor at the Chicago Community Child Guidance Cent- 
ers. This is an eclectic textbook which objectively presents the current status of this 
new implement in the psychotherapeutic armamentarium. Part one consists of ten 
chapters discussing the history, nature, mechanisms, procedures, processes and appli- 
cations of group therapy. Part two is a case book presenting representative verba- 
tim recordings of analytic, nondirective, family counseling and psychodramatic 
group therapy. 


KARPMAN, Ben. The Hangover. Springfield, Ill.: C. C. Thomas, 1957, pp. 531. $9.50 


Dr. Karpman, one of America’s foremost authorities on problems of psycho- 
pathology, here approaches the psychological problems of alcoholism by a detailed 
study of the hangover. This is accomplished empirically and inductively by pre- 
sentation of a detailed series of case histories illustrating most of the known phen- 
omena of alcoholism. The orientation is psychoanalytic and the presentation so 
authoritative that this book will undoubtedly take its place as a basic source book 
on the subject. 


Wotrr, Maurice D. and Woo.tr, JEANNE A. Remedial Reading. New York: 
McGraw-Hiil, 1957, pp. 424. $5.75 


This is one of the best texts on remedial reading which we have seen. The prob- 
lem of remedial reading is approached within the broad frame of reference of person- 
ality as a whole, citing the importance of making the most detailed psychological 
studies of general personality development against which reading problems and 
other symptoms may be evaluated. Theory and practice are presented eclectically, 
with well organized illustrative materials. 











SPRING LAKE RANCH 


A Farm in the Green Mountains Where Men and Women 
Facing Difficult Life Problems May Build Health, 
Confidence and Courage. 


Spring Lake Ranch offers an experience in normal group living to men and 
women well enough to cooperate in their own behalf. It is no sanitarium, but the 
joint outgrowth of the faith, leadership and skills of founders, staff and guests. 


Wuat Spring Lake Rancu Has To Orrer—a free natural environment, and per- 
missiveness that is unique and valuable, rural family life, all ages, both sexes, 
a self-governing group, in which guests help to plan and direct activities, a 
realistic program of work and play, emphasizing the big muscles, a mature plan 
leading to self-support as an incentive to adjustment. Formal psychotherapy, 
may be arranged for, with Dr. Frederick C. Thorne, who is also available as 
consultant on psychological and psychiatric problems. 


Wuat Sprinc Lake Rancu Does Nor Orrer—a “rest cure’”’, custodial care for 
individuals unable to cooperate, resident medical or psychiatric care, full-time 
supervision. 

A LayMAn’s Errort—From the start, leading psychiatrists have identified them- 
selves with this experiment, referring suitable patients, furnishing clinical 
histories, advising the staff. This teamwork has made possible a center dedi- 
cated to mental health in a remote setting without a resident psychiatrist—a 
layman’s complement to psychiatry. 


REFERRALS—Ranch life is designed only for those whose constructive forces may be 
reached. Space being limited, we only accept those likely to improve. Results 
have been good with conditions of fear, anxiety, depression, obsession; pre-hos- 
pital and middle-age emotional problems; alcoholics seeking help; epileptics 
whose convulsionsare under control; post-shock and selected post-lobotomy pa- 
tients. 


DurRaTIoNn oF Stay—‘“‘Ranchers’”’ are in two categories: those who require a strong 
pick-up, and “‘pressure-off’’ for 1 to 3 months, those who need rehabilitation 
between hospital and home, 3 months to 2 years. 


TRAINEES—The experience of 24 years has convinced us that this type of interim 
facility is needed throughout the United States. We are ready to receive as 
trainees qualified couples, to work with us a year or more with a view to starting 
such centres themselves. We offer a free, good life, a small salary, staff member 
status and the benefit of our total experience. 


WAYNE ANTHONY SARCKA 3 tt ELIZABETH MAN SARCKA 
Cuttingsville, Vermont 











NEW AND UNIQUE - - - 


THE HANGOVER 


A Critical Study in the 
Psychodynamics of Alcoholism. 


By 
BENJAMIN KARPMAN, M.D. 
Chief Psychotherapist 
St. Elizabeths Hospital 
Washington, D. C. 


“Tn the entire set of reactions which 
appear to follow the use of alcohol, 
none, it seems, is more remarkable 
than that which goes under the name 
‘HANGOVER.’ ”’ — From the Preface 


A frank and penetrating study of the lives 
of seven men and seven women alcoholics 
in whom the “hangover” is shown to be 
symptomatic of the specific under- 
lying personality disorders which lead 
to alcoholism. Dr. Karpman’s study shows 
that analysis of the hangover yields val- 
uable clues as to the nature of the person- 
ality maladjustment. Individual differ- 
ences in the reaction to alcohol are clearly 
shown in the hangover period. 


REFLECTED IN THE HANGOVER 
ARE VIRTUALLY ALL PSYCHOLOG- 
ICAL ASPECTS OF ALCOHOLISM 
Much of the material is in the words of the 
men and women concerned and they dis- 
cuss freely the interplay of problems con- 
nected with family, sex, employment, and 
social life. The material presented will be 
valuable to workers in many fields of the 
social sciences as well as to psychiatrists. 
Unusual original drawings by Wesley 
R. Wilken vividly illustrate major 
points in each case and convey the suffer- 
ings and the utter futility of the life of 
the confirmed alcoholic. 
560 pages (6 x 9) 16 illustrations 


Published fall Sent on 
approval, $9.50 














PARAPSYCHOLOGY 


Frontier Science of the Mind 


By 
J.B. RHINE, Ph.D., 
and 
J. G. PRATT, Ph.D. 
Psychology Laboratory of 
Duke University 


A SURVEY OF THE FIELD, THE 
METHODS, AND THE RESULTS OF 
ESP AND PK RESEARCH 


Today students in the colleges want to find 
out what is known about pst capacity, and 
even a few courses on parapsychology are 
being announced. To fill this need the 
authors have written a book that in itself 
introduces the reader to the field of para- 
psychology, defines the terms, describes 
the methods, and summarizes all the main 
facts accumulated to date. 


Provides for the convenience of profes- 
sional and research workers the basic 
techniques used in the tests on which most 
of the psi investigations have depended; 
and adds tables and measures for their 
appraisal of test results. 

It is READABLE; 

it is COMPREHENSIVE; 


it is USEFUL as a manual. 


Written in the world center of experi- 
mental research in the field it covers, the 
Parapsychology Laboratory of Duke Uni- 
versity. 


232 pages (6 x 9) 10 illustrations 
Published fall Sent on 
1957 approval, $4.75 
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DEVEREUX SERVES 


A full educational program, extending from kindergarten through 
the junior college level, is offered at Devereux for emotionally 
disturbed, as well as slow-learning children. 


Instruction is highly individualized and planned to take ad- 
vantage of the specialized learning techniques which are most 
effective for these children. 


The curricula of the Devereux units include a solid academic 


program for college preparatory work; a vocational and com- 
mercial program; and programs of artisanship or craftsmanship, 
stressing the use of special tools and skills, either as preparation 
for admission to a vocational or trade school or as a theans of 
attaining self-confidence and personality development. 


Professional inquiries should be addressed to 
John M. Barclay, Director of Development, 
Devereux Schools, Devon, Pennsylvania; western 
residents address Keith A. Seaton, Registrar, 
Devereux Schools in California, Santa Barbara, California. SCHOOLS 
COMMUNITIES 
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